Foreclosure with Incomplete Information LUCY WHITE Harvard Business School Baker Library 345, Soldiers Field Road Boston, MA 02163, USA [email protected] and Swiss Finance Institute and DEEP, Ecole des HEC Université de Lausanne Bureau 553, Internef, 1015 Dorigny Lusanne, Switzerland [email protected] We investigate the robustness of the new foreclosure doctrine and its associated welfare implications to the introduction of incomplete information. In particular, we let the upstream firm’s marginal cost be private information, unknown to the downstream firms. The previous literature has argued that vertical integration is harmful because it allows an upstream monopolist to limit output to monopoly levels, whereas a disintegrated structure will “oversell,” producing more in equilibrium. By contrast, we find that with incomplete information, high-cost firms will often “under-sell” in equilibrium, that is, supply less than their monopoly output. Low-cost firms continue to over-sell, so all types of firms have a reason to integrate downstream, but this is socially harmful only for low-cost types. For high-cost firms vertical integration can be Pareto-improving, resulting in higher output, profits, and consumer surplus. 1. Introduction Foreclosure may be defined as a dominant firm’s practice of inhibiting one or more (downstream) firms’ access to an essential input, which it supplies. The Chicago School (Bork, 1978; Posner, 1979) argued that such foreclosure cannot be profitable because the upstream firm cannot extend its market power by monopolizing the downstream market. Loosely speaking, there is only one monopoly profit to be earned in Much of this work was completed although the author was at GREMAQ, Université des Sciences Sociales, Toulouse and Nuffield College, Oxford. The author thanks an anonymous referee and an Associate Editor for very useful comments; as well as Liam Brunt, Paul Klemperer, Kai-Uwe Kühn, Catalina Martinez, Felix Ng, Patrick Rey, and Jean Tirole, seminar participants in Oxford, M.I.T. and the 1999 Meeting of the European Economic Association for helpful comments and suggestions. C 2007, The Author(s) C 2007 Blackwell Publishing Journal Compilation Journal of Economics & Management Strategy, Volume 16, Number 2, Summer 2007, 507–535 508 Journal of Economics & Management Strategy any given industry, and an upstream monopoly position is already sufficient to extract this: there is no need to interfere with competition downstream. More recently, a strand of the literature, which (for the sake of supplying a label) we will call the new foreclosure doctrine, has suggested that this argument is flawed because when there is competition downstream it can be difficult for the upstream firm to commit to limiting output to monopoly levels. Then vertical integration by a dominant upstream firm is harmful because it can lead to foreclosure of the unintegrated downstream firm(s), and thence reduced output. This more negative view of vertical mergers has been based on a series of complete information models (see, for example, the seminal paper by Hart and Tirole, 1990; McAfee and Schwartz, 1994, and the recent survey by Rey and Tirole, 2005). In this paper, we examine the robustness of the results of this literature to the introduction of incomplete information about upstream firm costs. This exercise leads to rather different policy conclusions. We show that whilst the new foreclosure doctrine continues to hold for low-cost firms, vertically disintegrated high-cost firms have signaling incentives to restrict output. These incentives do not arise in complete information models. Moreover, these incentives disappear—leading to a potential expansion of output—if vertical merger (or exclusive dealing) is allowed. Thus, the effect of allowing such vertical restraints on the output of high-cost types is ambiguous. In some cases, vertical merger will increase profits, output, and welfare. Thus, the current analysis suggests that antitrust authorities should adopt a less hostile approach to such vertical restraints than that implied by the new foreclosure doctrine. Introducing incomplete information into a foreclosure model is thus not just a simple robustness check, but yields different policy conclusions. There are two other reasons for pursuing this line of enquiry. The first reason is methodological. The reader may recall that the results of the new foreclosure doctrine depend on the assumption that downstream firms have passive conjectures about upstream firm sales when the latter makes offers to them. That is to say that these results depend on the assumption that when a downstream firm receives an out-of-equilibrium offer from the upstream monopolist, it continues to believe that its downstream rivals receive their on-the-equilibrium path offers. By contrast, if downstream firms instead have symmetric conjectures—that is, they believe that their downstream rivals always receive exactly the same offer as themselves, whether or not the offer was the expected one—then there is no need for firms to vertically integrate because the monopoly outcome is sustainable under vertical separation. Although passive conjectures seem generally more reasonable, one may Foreclosure with Incomplete Information 509 feel a little uneasy about the choice because both forms of conjecture constitute perfect Baysesian equilibria. It had been thought that introducing incomplete information might offer a solution to this dilemma.1 Unfortunately, we find that, at least in the particular game studied here, the introduction of asymmetric information does not offer the promised solution: one must still make an assumption about downstream firms’ off-the-equilibrium-path beliefs about the behavior of a given type of upstream firm in selling to his rival. But, arguably, the introduction of incomplete information does have the more limited benefit of making what amounts to an assumption of passive conjectures feel somewhat more natural by allowing downstream firms to update their beliefs if the unexpected change in output is large enough.2 The second, more concrete, advantage of the introduction of incomplete information is that the situation where upstream firms have more information about their costs than downstream firms is that presumably this is frequently the empirically relevant case.3 This is especially interesting as the majority of empirical studies show that vertical integration is associated with lower prices (see Lafontaine and Slade, 1997). This makes it difficult for antitrust policy makers to give too much weight to the new theory because according to the new foreclosure doctrine, vertical integration, when it forecloses unintegrated downstream firms, should be associated with higher prices 1. See, e.g., McAfee and Schwartz, 1994, p. (225), who suggest that “In order to make serious headway on these contracting problems it will be necessary to introduce asymmetric information explicitly . . . .” 2. For an alternative approach to resolving the difficulties surrounding the choice of conjectures in this and more general games, see Segal and Whinston (2003). Their work is related to ours in that they use menus of contracts to allow a retailer to distinguish ex post between suppliers according to what offers they have made to his rival. They do not consider incomplete information per se, and their aim is different: they wish to establish bounds on possible output levels and to consider how these converge as the number of retailers becomes very large. They do not consider the issue of vertical integration or its welfare effects. The problem of conjectures is circumvented by Riordan (1998) by simply assuming that one (dominant) downstream firm moves first and his moves are observed by the others. Chen (2001), on the other hand, examines the effects of vertical merger when there are multiple upstream firms one of which must be (weakly) more efficient than the others, and the offers made by upstream firms are public information. 3. For other (complete information) theories of foreclosure which focus on upstream costs of supplying downstream firms, see Choi and Yi (2000) and Church and Gandal (2000). In these papers, costs are perfectly known, but are endogenous to choices about input specificity or compatibility made by upstream firms. Ma (1997) considers the case where downstream firms write contracts with consumers to supply options on upstream goods as in for example the market for health insurance. This line of literature, starting with Ordover Saloner, and Salop (1990) is theoretically somewhat distinct from that considered in this paper in that the anti-competitive effects of vertical merger arise even when upstream firms can make public offers, but generally require the presence of a competing upstream firm and a friction which means that an integrated firm will be unable or reluctant to sell to unintegrated downstream firms. 510 Journal of Economics & Management Strategy (Cooper et al., 2005). Of course, no one ever claimed that foreclosure was the only reason for vertical integration; many and perhaps most vertical mergers may be expected to result in efficiency enhancements which reduce costs and hence prices. For example, vertical integration might lower transaction costs (Williamson, 1975; 1985) or eliminate double marginalization (Spengler, 1950) and this may explain why in statistical studies vertical integration seems to reduce prices.4 Perhaps more problematic for the emerging literature is the surprising result that even if one restricts attention to studies of industries where the government has intervened to outlaw specific vertical restraints—prima facie a more favorable testing ground for new foreclosure theories—one finds that the supposedly procompetitive government policy has tended to raise prices (Lafontaine and Slade, 2005). On the other hand, the traditional explanations of the efficiency gains arising from vertical integration mentioned above face a theoretical difficulty, in that it is never entirely clear why these efficiencies cannot be obtained through contractual means without vertical integration. For example, offering two-part tariffs would seem to be a much less costly solution to the problem of double marginalization than vertical integration.5 In this paper, we demonstrate the existence of what can be seen as another type of “efficiency enhancement” associated with vertical integration: the removal of the need to engage in costly signaling. We show that when there is incomplete information about costs, vertical integration can easily result in high-cost types selling to downstream firms at lower prices than they would if vertically separated. Vertically integrated low-cost types will sell at higher retail prices. Thus it is possible that, consistently with the empirical literature, prices will fall “on average” if high-cost types occur sufficiently frequently compared to low cost types. We provide a numerical example in the Appendix where this is the case. In finding that vertical integration can reduce prices and raise welfare, our paper is related to Baake, Kamecke, and Normann (2004). They consider a model very similar to that used here, but they allow the upstream monopolist to publicly choose “capacity” before making secret contract offers to downstream firms. By publicly committing to a capacity which is very low, the upstream firm commits to having high marginal costs, and thus to low output. Thus, the monopolist underinvests in capacity as a way of solving his commitment problem; 4. For surveys of other reasons why vertical integration and other vertical restraints might arise, see Perry (1987) and Katz (1987). More recent perspectives on theory and policy are provided by Hovenkamp (2001), Klass and Salinger (1995), Rey and Tirole (2005), and Riordan and Salop (1995) (see also the comment by Reiffen and Vita, 1995). 5. See Whinston (2003) for a recent critique of the transactions cost theory. Foreclosure with Incomplete Information 511 the cost of this is that ex post he uses too little capacity and too much of the variable input, that is, he does not minimize the cost of producing his chosen output. This implies that, under some conditions, vertical integration, which leads to monopoly with efficient production, can be socially preferable to vertical separation. The broad logic behind this policy conclusion—that the upstream firm pursues a “puppy dog” strategy to gain some commitment not to overproduce—is similar to that in the present paper.6 The specifics, however, are very different. The Baake, Kamecke, and Normann result relies on the upstream firm being able to publicly commit to having high marginal costs; whereas on the contrary, the result in the present paper relies on the upstream firm’s marginal cost being unknown. Once the upstream firm’s marginal cost becomes common knowledge, the signaling incentives, which sustain the commitment to low output, are destroyed, as we now explain. The reason why introducing incomplete information about upstream costs makes such a difference to the standard model is relatively simple and intuitive. In the complete information model, each of the downstream firms constitutes an entirely separate market from the point of view of the upstream firm. When downstream firms do not observe each others’ orders, the upstream firm’s profit from selling to one downstream firm is independent of how much it actually sells to the other downstream firm(s) (it depends only on how much it is expected to sell to it in equilibrium). Therefore the upstream firm has no way of credibly signaling to one downstream how much it is selling to the other(s), so downstream firms fear opportunistic behavior by the upstream firm. The only way they can be sure that the quantities sold to other firms will be as anticipated is if all quantities are best responses to one another, so that there is no incentive to deviate from expectations. This implies Cournot output when the strategic variable is quantity.7 By contrast, when upstream costs are unknown, a low quantity offer may be interpreted as indicating high costs, and hence low quantity offers to other downstream firms as well. This creates signaling incentives for all cost-types to reduce output that do not exist in the complete information model.8 Under incomplete information, there is a tendency for output to be distorted downwards under vertical separation, which can be alleviated by vertical integration. 6. I thank an insightful referee for bringing this to my attention. 7. As remarked above, this argument depends on the assumption of passive conjectures, and will be explained in Section 2 below. The case of price competition downstream has been considered by O’Brien and Shaffer (1992), yielding similar results; though in this case the analysis is considerably more sensitive to the choice of equilibrium concept (see Rey and Vergé, 2004). 8. Incomplete information has been introduced into the related model of the durable goods monopolist by Ausubel and Deneckere (1992). 512 Journal of Economics & Management Strategy Of course, the problem with introducing incomplete information to the model is as usual as that the possibility of signaling leads to a multiplicity of equilibria, some of which are very inefficient, for example, equilibria in which all firm types produce less than their monopoly output. Clearly, some mechanism is required to eliminate equilibria that are supported by “unreasonable” beliefs. Since this is a model of an informed principal, we use the device suggested by Maskin and Tirole (1992) to alleviate this problem. We will focus on the Cho–Kreps leastcost signaling equilibrium. In this equilibrium, the lowest cost types have output that is undistorted relative to complete information case (i.e., Cournot output). The high-cost types have output low enough that the low-cost types do not want to imitate them—so if costs are similar enough, high-cost types must produce less than their own monopoly output. In consequence, policies that allow the monopolist to achieve his monopoly level of output—such as vertical integration, exclusive dealing, or nondiscrimination clauses9 —improve output, profits, and welfare for high-cost types, and reduce welfare for low-cost types. The plan of the paper is as follows. In Section 2, we begin by setting out the complete information benchmark for those unfamiliar with the new foreclosure doctrine.10 Section 3 sets out and introduces our model of foreclosure with incomplete information about upstream costs. We explain our approach in dealing with the problem of multiple equilibria that arises. We will use the “market-by-market” assumption to select off-the-equilibrium path conjectures about the actions of a given type, and solve the signaling game as an informed principal problem using the device suggested by Maskin and Tirole (1992). In Section 4, we discuss the properties of the least-cost separating equilibrium of the signaling game, which is the one selected by this approach. In Section 5, we close the model by requiring that beliefs be correct in equilibrium, and offer some thoughts on welfare analysis. Section 6 concludes. In the Appendix, we provide some proofs, discussion of alternative equilibria, and also a numerical example to show that— contrary to the conclusion under complete information—welfare can be improved by vertical merger. 9. Whether nondiscrimination clauses can fully solve the upstream firm’s commitment problem depends on exactly how these clauses are applied (to a whole contract or to each separate component of the contract) — see DeGraba and Postlewaite (1992), McAfee and Schwartz (1994), DeGraba (1996) and Marx and Shaffer (2001). The usefulness of, and potential difficulties in enforcing, exclusive dealing and resale price maintenance contracts is investigated in Alexander and Reiffen (1995). For brevity, in this paper we will concern ourselves with impact of vertical merger, but, subject to the caveats noted in these papers, the extension of the results to other vertical restraints should be straightforward. 10. Readers familiar with the theory may wish to skip this section. Foreclosure with Incomplete Information 513 FIGURE 1. THE STRUCTURE OF THE MARKET 2. The Complete Information Benchmark The original complete information model where lack of output commitment yields incentives for vertical merger is due to Hart and Tirole (1990). Here we adopt the simpler structure used by Rey and Tirole (2005) in their recent survey of the literature. There is one upstream firm, U, which produces a good with constant marginal cost c. U does not sell his good directly to consumers, but to downstream firms, who “process” (or package, retail, etc.) the upstream good on a one-to-one basis into an undifferentiated downstream good, which they then sell to consumers. There are two downstream firms, denoted D1 and D2 .11 We let the marginal cost of processing be constant, and without loss of generality set it to zero. Let all the fixed costs in the upstream and downstream firms be zero. Consumers have inverse demand curve for the product P(Q), where Q is the total amount offered for sale by the downstream firms D1 and D2 , whose sales are denoted q1 and q2 , respectively. Figure 1 illustrates the structure of the industry. 11. The model is easily generalized to the case of more than two downstream firms. 514 Journal of Economics & Management Strategy The timing of the game is as follows. At t = 1, U makes simultaneous take-it-or-leave-it contract offers to D1 and D2 .12 The contracts which U offers consist of tariff schedules Ti (qi ) for i = 1, 2, stating the price which firm i must pay in exchange for receiving quantity qi . Di ’s contract may refer only to the quantity, which Di will receive, and is not allowed to depend on the quantity that Dj will receive. This may be either because this information is unverifiable, or because such contracts are outlawed by antitrust authorities.13 Neither downstream firm observes the contract, which his rival is offered. However, each may form conjectures about what his rival has been offered based on the offer that he himself has received. At t = 2, given his beliefs about the likely behavior of his rival, each downstream firm i accepts or rejects U’s contract offer. If Di accepts the offer, he chooses a quantity qi from the menu offered by U and pays the corresponding tariff Ti (qi ). Notice however that because U has perfect information about the downstream firms, it can actually dictate their quantity choices, for example, by offering tariffs schedules of the form “Ti (q) = Ti if q = qi , and ∞ otherwise.” Therefore we will henceforth suppose that U makes offers of the form (Ti , qi ), where, because U makes take-it-or-leave-it offers, Ti is set so as to extract all of Di ’s expected profit. It follows that in making its offer to Di , U aims to set the preferred quantity to maximize Di ’s expected profit net of its own costs of supply, which is given by Ti − cqi = max qi P(qi + qje ) − cqi , where the superscript e denotes firm i’s expectation of firm j’s output. At t = 3, downstream firms take delivery of their orders, and transform some or all of the intermediate good received into the final downstream (consumer) good. They observe each other’s output, set their prices, and sell to consumers. Thus, this stage of the game is as in Kreps and Scheinkman (1983). The perfect Bayesian equilibrium of this game evidently depends very strongly on the conjectures made by downstream firm Di about the offer that has been made to its rival Dj (i = j). Suppose first that both downstream firms have symmetric conjectures, which is to say that suppose whatever offer Ti (.) Di receives, it believes that Dj will receive exactly the same offer, Tj (.) = Ti (.), even if Di ’s offer itself was an unexpected one. Then the fact that Di does not actually observe Dj ’s offer proves irrelevant: the outcome is the same as if such observation were possible. This is because Di will refuse to pay more than T = qi × P(2qi ) 12. For a consideration of the effect of allowing the downstream firms some bargaining power, see Chemla (2003). 13. Note that in the complete information model, antitrust authorities would be wise to prohibit such contracts, as they will allow the upstream firm to restrict output to monopoly levels. Foreclosure with Incomplete Information 515 for an offer of quantity qi . Therefore in choosing his offer to Di , U maximizes qi P(2qi ) − cqi , and so chooses an offer qi = 12 qm , where qm = argmax qP(q) − cq, is the industry monopoly quantity. Each downstream firm receives half the monopoly quantity, so it is as if the upstream monopolist could commit to selling only that amount. However, there is no very good reason to suppose that downstream firms do hold symmetric conjectures. If U makes an unexpected (i.e., out-of-equilibrium) offer to D1 , why should D1 suppose that U has made the same deviation with D2 ? The problem is that from U’s point of view, D1 and D2 constitute entirely separate markets (although of course D1 and D2 themselves perceive a strong inter-dependency). U’s aim is to maximize profits with each of D1 and D2 , and given that it has constant marginal costs, U’s optimization program with D2 is unaffected by how much he actually sells to D1 . (Observe that it only depends on D2 ’s expectation of how much D1 receives, not on how much D1 actually receives.) So if U is expected to maximize profits with D2 , there is no reason for U to change D2 ’s offer when he changes the offer he makes to D1 , and therefore no reason for D1 to suppose the contrary. For this reason, the literature has concentrated on the use of socalled “passive conjectures.”14 D1 and D2 each have prior beliefs about the quantities each of them will receive, and neither updates these beliefs during the course of the game, even if they themselves receive an unexpected offer. If downstream firms hold such beliefs, and U maximizes its profits with each of them, it should sell q1 = BR(q2e ) = argmaxq1 P(q1 + qe2 ) − cq1 to D1 and similarly q2 = BR(q1e ) to D2 , where BR(.) denotes the Cournot best response function. Imposing the condition that beliefs must be correct in equilibrium, we find that in equilibrium, U sells the “Cournot” quantity qc for a firm with marginal cost c (where qc solves qc = BR(qc )) to each of the downstream firms. Using the symbol π to denote downstream revenues, U’s total profit is therefore given by the sum of the two firms’ Cournot profits 2(π c − cqc ), which is less than the monopoly profit π m − cqm . Thus, even though U is in a monopoly position and can make take-it-or-leave-it offers to downstream firms, his inability to commit himself makes it impossible for him to extract the whole monopoly profit from the industry. This problem is sometimes called a Coasian commitment problem, by analogy with Ronald Coase’s seminal analysis 14. McAfee and Schwartz (1994) also investigate the equilibrium of a similar game under what they call wary beliefs. A firm that holds wary beliefs thinks that its rival received an offer which is a best response to its own offer. That is, it thinks that the monopolist is maximizing joint profits with its rival, given its own offer. In their game wary beliefs and passive beliefs yield the same outcome, though the conditions for the existence of an equilibrium are more stringent with wary beliefs. 516 Journal of Economics & Management Strategy of the over-selling problem of a durable goods monopolist (Coase, 1972). By contrast, if U were permitted to integrate vertically (i.e., in this context, share ex post profits) with one of the downstream firms, say D1 , U would internalize the loss of profit D1 suffers when U sells to D2 . Writing an exclusive dealing contract with D1 (in which U agrees to deal only with D1 ) serves the same purpose.15 Any of these measures allows U to foreclose the downstream market by denying D2 access to his product. Thus, foreclosure restores U’s monopoly power and results in reduced output and consumer welfare. For this reason, it has been argued that antitrust authorities should take a tough stance against attempts to foreclose markets. 3. The Incomplete Information Model We saw in the previous section that the new foreclosure doctrine relies heavily on the assumption of passive conjectures. If conjectures were symmetric, vertical integration would have no anticompetitive effects and would merely reflect efficiency considerations, as originally argued by the Chicago School (Bork, 1978; Posner, 1979). But the argument for the use of passive conjectures, although persuasive, may nevertheless leave one feeling a little uneasy. Suppose one were actually in the position of D1 receiving an unexpected offer. Wouldn’t one ask oneself: why didn’t U make the offer that seemed to maximize his profits with me? In the complete information model, there can be no answer to this question: U took a probability zero action. But in reality, one might think that what one thought was U’s best offer turned out not to be U’s best offer— perhaps his costs were higher than expected, for example. This suggests that an incomplete information model might be a better tool to capture the kind of phenomena which the new foreclosure literature analyzes.16 This section introduces incomplete information into the model. For simplicity and ease of comparison, the model we use is exactly as before (Section 2) with one addition. We allow c, U’s marginal cost, to be 15. Note that the direction of the exclusive dealing contract is the opposite to that usually studied in the literature: U deals exclusively with D1 , though D1 does not necessarily deal exclusively with U. It has been argued that exclusive territories can play the same role in geographically differentiated markets (McAfee–Schwartz 1994). For a model incorporating exclusive dealing between a downstream monopolist and one of two upstream rivals (i.e., the reverse set-up to that considered here) see O’Brien and Shaffer (1997). 16. Interestingly, in their experimental study of foreclosure, Martin et al (2001) find that downstream participants’ beliefs seem to be something of a mixture of passive and symmetric conjectures. This will be reminiscent of our result that beliefs will adjust if the cut in output is large enough and will be passive otherwise. Foreclosure with Incomplete Information 517 initially unknown. At t = 0 in the timing given above, Nature makes a move determining U’s type, which is revealed privately to U. There are two possible types of upstream firms, high cost and low cost, denoted UH and UL , respectively. We denote their respective utility functions by UUH and UUL . Downstream firms D1 and D2 (about whom there remains complete information) have utility functions UD1 and UD2 , respectively. Nature selects a high-cost upstream firm with probability α, and a lowcost upstream firm with complementary probability 1–α. We denote D1 ’s e beliefs about what will be sold to D2 by q2H if he is facing a high type and e q2L for a low type. The actual amounts sold are denoted q2H and q2L , respectively. An analogous notation is used for the quantities sold to D1 e (q1H and q1L ), and for D2 ’s expectations about these quantities (q1H and e q1L ). As in the previous section, downstream profit (gross of payments to the upstream firm) is denoted by π(.,.). As in the previous section, we will not model the integration decision explicitly; we will simply treat the case where downstream firms are unintegrated and assume that an integrated vertical structure can achieve the vertical monopoly profit by pricing accordingly (because in that case there is no commitment problem). But in order for this approach to be valid, we now need to assume that if firms integrate (or sign exclusive dealing contracts) then they must do it at a stage t = −1, before upstream costs are known. Otherwise there might be information contained in the integration or exclusive dealing offers, which would complicate our analysis. Although in this model (unintegrated) D1 does not care directly about U’s cost, U’s cost affects the offer he can be expected to make to D2 , so D1 ’s utility is indirectly dependent on U’s type. Therefore U will wish to signal his type through his offer to D1 . This means that we have a signaling game with a potential multiplicity of equilibria. Some of these are very inefficient—for example, they may involve both types of upstream firm producing more than the best-response (Cournot) output or less than the high-cost monopoly output (in each case, firms “pool” for fear of being thought to be low-cost for certain). But such equilibria are plainly implausible, and can be sustained only if D1 “threatens” to “punish” deviations from equilibrium behavior by adopting particularly unfavorable beliefs about deviating firms’ types. In fact, the problem of multiplicity of equilibria is in some sense even worse than in standard signaling games, because it is compounded by the multiplicity of equilibria in the complete information game— described in the previous section—which results from the relative freedom in choosing a firm’s beliefs about the upstream firm’s sales to its rivals when it receives an off-the-equilibrium path offer. Sometimes 518 Journal of Economics & Management Strategy introducing a little uncertainty into a complete information game does serve to reduce the set of equilibria (e.g., in the supply function literature, Klemperer and Meyer (1989)) but that is not the case for the particular model studied here. Intuitively, the two forms of multiplicity are “independent” of one another in that even if one solves the multiplicity of equilibria in the signaling game by a suitable choice of refinement, one is still free, given an assignment of type, to assign beliefs about what that type will offer to the rival downstream firm in the event of an out-of-equilibrium offer; and conversely, that even if one decides that “passive beliefs” are appropriate in the complete information model, if different types act differently, one still has to solve the signaling game multiplicity problem to assign a type (and hence an offer to the rival) in the event of an out-of-equilibrium action. Thus our approach will essentially be to tackle these two multiplicity problems separately. With respect to the multiplicity problem that arises due to conjectures even in the complete information model, we will adopt the “market-by-market” approach described in the previous section, which, when applied to the complete information problem, implies passive conjectures. We do this partly because we think that, because upstream profits do not depend on sales to downstream rivals, it is reasonable to assume that upstream offers do not depend on this either. But the main point is that if we wish to investigate how the introduction of incomplete information affects the previous complete information literature, it makes sense to adopt the same approach as that literature in modeling the way that off-the-equilibrium-path conjectures are formed. Note, however, that (as the opening paragraph of this section suggests), and as we will show in Section 4 below, the adoption of the market-by-market approach with incomplete information will not imply beliefs that are entirely passive, but rather beliefs that will be updated only if the offered output is sufficiently low. The adoption of the market-by-market approach allows us to proceed in the following discussion by taking the outcomes of UL and UH ’s interactions with D2 as given and concentrating entirely on U’s interaction with D1 . In Section 5, we will close the model by arguing that a similar interaction occurs between D2 and the upstream firm on the “other side” of the market, and considering what this implies for the market as a whole. Before that, in the remainder of this section, we tackle the problem of the multiplicity that arises from the incompleteness of information about U’s type. Perhaps the most elegant way of handling the problem of multiplicity in signaling games is to use the framework suggested by Maskin and Tirole (1992). Rather than introducing an equilibrium refinement, they instead argue that traditional signaling models do not exhaust the Foreclosure with Incomplete Information 519 contracting opportunities open to the informed principal (i.e., in our context, the upstream firm), because they assume that the informed party offers a single contract to the uninformed party. Then, if the informed party offers an unexpected contract, the uninformed party must form beliefs about the informed party’s type, and the flexibility the modeler has in assigning these beliefs can be used to support a large set of equilibria. Maskin and Tirole show that if, instead, the informed party is allowed to offer an incentive-compatible menu of contracts, this multiplicity of equilibria will be much reduced, sometimes (depending on parameter values) to a unique equilibrium. In the working paper version of this paper (White, 2003), we show in detail how this method can be used to restrict the set of equilibria in our game.17 Here we spare the reader the technical details and merely provide an informal discussion and intuition for the result. The way that the Maskin–Tirole mechanism works in the context of this model is as follows. Because U has two possible types, U offers each downstream firm D1 a pair of contracts (TL , qL ), (TH , qH ) from which U will choose after D1 has accepted. If the pair of contracts is incentive compatible for U, then D1 knows which type will choose which contract ex post, so there is no room for D1 to inflict punishment by beliefs. Moreover, if D1 breaks even on each of the possible contracts in the menu, given the type that will choose that contract, then D1 does not care which type he faces. Consider maximizing each upstream firm type’s utility from his own choice from such a menu subject to it being profitable for D1 type-by-type: the resulting menu can be called the low information intensity (LII) optimum for that type because (given that the allocation is such that the downstream firms will break even with each type of upstream firm separately) they have little incentive to gather information about the upstream firm’s type. Clearly each upstream type can guarantee itself at least as much utility as it gets in its LII optimum (simply by offering its LII menu to D1 ), so this provides a lower bound to each upstream type’s equilibrium utility. Now combine each type’s own choice from its LII optimum to form the low information intensity (LII) allocation, which, by construction, yields the same utility for each type and is also incentive compatible. What does the low information intensity allocation look like? It turns out to be none other than the familiar Rothschild–Stiglitz– Wilson or least-cost separating equilibrium. The reasoning relies on the observations that (a) the low-cost (“bad”) type can always do at least as well under asymmetric as under symmetric information 17. See also Segal and Whinston (2003) for another recent application of the Maskin– Tirole framework., and Tirole (2006) for a detailed textbook exposition of the method. 520 Journal of Economics & Management Strategy (b) if a downstream firm were to accept the low-cost type’s optimal symmetric information contract and yet discover that he was in fact facing the high-cost (“good”) type, he would not lose money, and (c) the high-cost (“good”) type can do at least as well when the lowcost type chooses his preferred contract under symmetric information as the high-cost type does when he offers his LII optimum, because the low-cost type’s incentive constraint may bind more tightly in the latter case. The final step in the argument is to note that if this low information intensity allocation is interim efficient, then no type can do any better than this allocation; and, because we have just argued that each type can do at least as well, the low information intensity allocation, or equivalently, the least-cost separating equilibrium, is the unique equilibrium of the game where upstream firms offer menus of contracts. One can show that, provided that the probability α of the high type occurring is less than some critical value α ∗ > 0, then the LII is indeed interim efficient, and the least- cost separating (LCS) allocation is thus the unique equilibrium. The derivation of α ∗ is given in Appendix A. Numerical calculations suggest that, though strictly positive, α ∗ can be quite small (see Appendix C). If instead this condition is not satisfied (i.e., if α > α ∗ ) then the separating allocation is still a perfect Bayesian equilibrium, but it is no longer unique. The set of equilibrium payoffs is then the set of incentive-compatible, profitable-in-expectation allocations that (weakly) Pareto dominate the low information intensity allocation.18 These “high information intensity” equilibria are discussed in Appendix A; here in the text, we will focus on the low information intensity optimum. There are several justifications for this. 18. To see this, note that when there exist allocations which Pareto dominate the LCS allocation it may be possible for the upstream firm to make a move which will give him strictly more than his low information intensity optimum. Specifically, when the probability of encountering the high (“good”) type is large enough, it may be more profitable to offer a high information intensity contract, where the downstream firm breaks even not with each upstream type individually, but “on average” across types. Because this allows the pooling of two participation constraints, it may increase the value of the optimization program. Any incentive-compatible, profitable-in-expectation allocation which Pareto dominates the low information intensity allocation can be sustained as an equilibrium, by simply assuming that if the upstream firm offers the option contract corresponding to the desired equilibrium then D1 does not update his beliefs from (α,1−α); in which case D1 ’s best response is to accept the offered contract. Offering the contract is in turn the best move for the upstream firm if D1 would otherwise “punish him by beliefs” by putting probability zero on the high-type if he offered any contract Pareto-dominating the expected contract. (Offering a contract other than the expected one is a probability zero event, so, given that our equilibrium concept is perfect Bayesian, we can apply any off-theequilibrium-path beliefs here.) Clearly the Pareto-dominated low information intensity allocation can be sustained as an equilibrium in the same way. Foreclosure with Incomplete Information 521 Firstly, using further equilibrium refinements, such as Cho and Kreps’ (1987) “intuitive criterion” in the familiar fashion also yields the least-cost separating allocation as the unique equilibrium. This is because in any equilibrium which is not profitable type-by-type, the high-cost type can find a deviation that, if it leads the downstream firm to believe that he is high cost, makes the high-cost firm better off but the low-cost firm worse off. The intuitive criterion suggests that in such a case, the downstream firm ought indeed to update his beliefs in such a way that he places zero probability on the type that would not gain from making this deviation—and this breaks the profitable in expectation high information intensity equilibria. Secondly, if the downstream firms can find out (at some cost to themselves) the upstream firm’s marginal cost, then, as their name suggests, the high information intensity equilibria may not be stable. The reason is that downstream firms make a loss with the low-cost types and a profit with the high-cost types, so they will have a strong incentive to find out which type of firm has offered them a high information intensity contract. If they do so, then they will accept such a contract only when they face a high type, which will upset the equilibrium. 4. Properties of the Least-Cost Separating Equilibrium In the least-cost separating (or low information intensity) equilibrium, the low-cost type receives his symmetric information payoff. The intuition behind this is that, because in a separating equilibrium, his type will anyway be revealed (as the worst possible), the low-cost firm might as well maximize profits rather than distort output in a failed attempt at signaling his type.19 As we saw in Section 2 above, under symmetric information about costs, the low-cost type simply offers a contract, which is the best response to what D2 is expected to receive, and a tariff to extract all the expected profit. Thus, the Coasian situation of the low-cost firm is unaltered by the introduction of incomplete information. By contrast, (if the no-mimicking constraint in the separating programme is binding) the high-cost type does not maximize his profit with D1 given the expectation of q2H , so the Coasian situation of the highcost type is altered. Let us assume for the moment that this constraint does bind. (We will show below under what conditions this will be the 19. For readers thinking in terms of the Cho–Kreps equilibrium selection device, this property comes from the deletion of dominated strategies. Strategies where the low-cost type D1 ’s type are revealed and yet he does not produce his best-response output are dominated by producing his best-response output and having his type revealed. 522 Journal of Economics & Management Strategy FIGURE 2. EQUILIBRIUM DISTORTION IN UH ’S OUTPUT TO AVOID IMITATION BY UL case in equilibrium.) If UH simply aimed to maximize joint profits with D1 by producing his best-response quantity, the low-cost type would e imitate him, and so D1 could not be sure that q2H was indeed the correct expectation. So UH has a commitment not to sell “too much” to D1 , because if he did then D1 ’s beliefs would change adversely. UH must distort his output downwards compared to his best response.20 The distortion is just enough that the low-cost type is indifferent between his best response to his own expected sales to D2 on the one hand; and the advantage of having D1 believe that his rival will receive only e q2H combined with the disadvantage of being able to selling only the high type’s distorted output on the other.21 (See Figure 2.) 20. This must be true in any separating equilibrium, not only in the least-cost separating equilibrium that we have selected. In this sense our results are robust to the equilibrium refinement. I thank Paolo Garella for this remark. 21. Again, those familiar with the Cho–Kreps framework will recognise that this property arises from the test of equilibrium domination. This ensures separation rather Foreclosure with Incomplete Information 523 Notice that the introduction of incomplete information to the model has allowed us to partially endogenize the passivity of conjectures. Although one can make an argument in favor of passive conjectures, the choice between passive and symmetric conjectures in the complete information model is necessarily somewhat arbitrary. In the incomplete information model, this defect is partly remedied because each downstream firm will sometimes update his beliefs about the offer received by his rival, depending on the offer that he himself receives.22 That is to say, the updating of beliefs is regulated by equilibrium refinements. If the offer which a downstream firm receives is sufficiently small that a low-cost firm would never want to make such an offer (even if he were thereby believed to be high cost), the downstream firm will indeed update his beliefs to suppose that he is facing a highcost firm, and that the quantity his rival receives will also be small. If the quantity offer is not so small, then the downstream firm’s conjectures will be genuinely passive: he will not be led to believe that he is facing a high-cost firm.23 5. Results To close the model, we need to consider what happens in the contract game between U and D2 . Clearly if there is separating on one side of the market, then there is an incentive to signal on the other side of the market.24 So this game must also be solved for D2 on the other side e e and q1L . Evidently the of the market, given arbitrary conjectures q1H than pooling, as low-cost type would never want to produce less than this even if he were thereby believed to be a high-cost type; thus the high-cost type can deviate to this output and make a speech to this effect. Lower outputs than this are dominated for the high-cost type. 22. Note that we retain the “market-by-market” assumption that U offers contracts separately and unobservably to D1 and D2 . This assumption was used to justify passive conjectures in Section 2. In a complete information framework, this argument is reasonable in that U’s maximization problem with each downstream firm is independent of how large a quantity he actually sells to the other downstream firm. But in an incomplete information context, it can no longer serve to justify passive conjectures per se, because the quantities sold in each market are nevertheless correlated because of their mutual dependence on upstream costs. Therefore some quantity offers to D1 will lead him to update his beliefs about the quantity offered to D2 (and vice versa). 23. Note, however, that the remedy is only partial. If downstream firms were to hold completely symmetric beliefs, then the equilibrium output would be the monopoly output as in the complete information model. This is because the upstream firm’s type then becomes irrelevant information from D1 ’s point of view (he already “knows” sales to D2 ) so no signaling occurs. But we would argue (see Section 3 and the previous footnote) that symmetric beliefs seem less palatable—and “market by market” beliefs less stark—in an incomplete information context. 24. Thus pooling (i.e., both cost types producing the same quantity) on only one side of the market is not an equilibrium. Pooling on both sides of the market is not an equilibrium either because then quantity offers on one side, say D1 , do not signal anything about 524 Journal of Economics & Management Strategy case will be analogous: the low-cost type will sell D2 his best response to the expected output of D1 , and the high-cost type will reduce his output to just enough that the low-cost type does not mimic. Finally, to complete the equilibrium, we need to check that beliefs are correct e e e in equilibrium, that is, q1H = q1H and q2H = q2H , q1L = q1L and q2L = e SI e q2L . Given that we saw above that q1L = q1L = BR(q2L | cL ) (where the superscript SI denotes symmetric information), and the same will be true for q2L , this means that q1L and q2L will be best responses to one another, so the low-cost type will produce his Cournot output qLC on each side of the market. For the high-cost type, the interesting case arises when his output is distorted away from mutual best responses (highcost Cournot output qHC ) by signaling considerations. Therefore we assume A1: Costs are sufficiently close that the low-cost type’s incentive comC C C patibility constraint (IC : UL →UH ) binds: π ( q H , qH ) − cL q H ≥ π (q LC , q LC ) C − cL q L . A sufficient condition for this to occur is that the high-cost type’s Cournot output is greater than the low-cost type’s monopoly output. Then, if the H-type produces best responses to his own output, the L-type can certainly increase profit by mimicking him, because by doing so, the L-type moves closer to his own monopoly output. When A1 holds, the high-cost type’s output qSE (SE for separating equilibrium) is distorted below his best response to his own output on each side, and is defined by π(q SE , q SE ) − c L q SE = π q LC , q LC − c L q LC . Under A1, it is easy to show that the high type produces less under incomplete than under complete information: Proposition 1: In equilibrium, the low-cost type produces his type’s Cournot output, and the high-cost type produces less than his own Cournot output. This proposition follows easily from A1, as the high-cost firm’s output must be greater than his Cournot output in order to successfully separate from the low-cost firm. Notice however that the signaling that occurs in this model has a rather strange feature. It is costly on each side of the market individually, because output is distorted away from the best response. But overall, the need to signal is actually beneficial quantity offers on the other side D2 . So firms do best by producing their best responses to D2 ’s output, which differ according to cost. Foreclosure with Incomplete Information 525 to the high-cost type because it allows him to commit to low output. In particular, under the assumption of a quadratic profit function (i.e., linear demand), we can prove that: Proposition 2: The low-cost type is just as well off, and the high-cost type is strictly better off, under incomplete information compared to complete information about costs. Proof. See Appendix This result implies that the upstream firm will have little ex ante incentive to make his costs transparent to downstream firms. He anticipates greater profits in a situation where his costs are private information. This theoretically validates our investigation of incomplete information about upstream costs. The result demonstrated in proposition 2—that the high-cost firm will be better off under incomplete information—would be obvious if 2 × qSE could be shown to be always above the high-cost type’s monopoly output, qM H . However, this need not necessarily be the case (see the proof of the proposition above and the numerical example in the appendix). It is clear that the low-type will always want to imitate the high-type if the latter’s total production is more than qM L . If the costs of the two types are very close, then their monopoly outputs will also be close, so that the low-type would still want to imitate the high-type if the latter were believed to be selling 12 qM H to Di . For the incentive compatibility constraint to hold in this case, q1H + q2H will have to fall below the monopoly output qM H . Thus we have: Proposition 3: If costs are sufficiently close, high-cost output in the signaling game will be less than high-cost monopoly output. By contrast, if the high-cost U is allowed to take measures to ‘foreclose’ D2 from the market, his production will be qM H . This is because the need to signal costs (and so, indirectly, sales to rivals) disappears when downstream firms no longer fear expropriation.25 For example, if D1 is vertically integrated with U, D1 knows that U will internalize the effect on D1 ’s profits if he sells to D2 . Similarly, if U signs a contract to deal exclusively with D1 , D1 does not care about the level of U’s costs because he knows that U will not sell to D2 . Thus we have the following corollary: Corollary: When costs are sufficiently close, output, profits, and welfare will improve when the high-cost type vertically integrates, or takes other 25. As noted above, we are assuming that the decision to vertically integrate or sign an exclusive dealing contract must be taken before costs are known so that the negotiations over integration or exclusive dealing do not themselves reveal information. 526 Journal of Economics & Management Strategy measures (such as exclusive dealing), which would ordinarily foreclose the market. This result is important for three reasons. Firstly, as noted in the introduction, it brings the foreclosure literature closer into line with the available empirical evidence that downstream prices tend to be higher on average with vertical separation. Our result shows that this fact is not necessarily inconsistent with the existence of Coasian commitment problems if there is incomplete information. Secondly, it is interesting to note as an aside that the model gives no support to the intuitive idea that the need to avoid opportunism could give rise to contractual rigidity. McAfee and Schwartz (1994) suggest that the lack of variation among franchise contracts might be attributable to upstream firms’ need to signal to nonintegrated downstream firms that they are not acting opportunistically. A complete investigation of this idea is outside the scope of this paper. But consider a simple two-period version of our signaling model, where U’s cost is either H or L, independently drawn in each period. The repeated play of the equilibrium we have highlighted for the one-shot game is an equilibrium of the two-period repeated game.26 But in this game, it is easy to see that the need to signal that there is no opportunism tends to result in greater rather than less variation in contractual terms across periods. If the upstream firm could contractually commit to D1 not to sell opportunistically to D2 , then D1 ’s one-period contract would be either qLM or qH M according to U’s cost realization in that period. This compares to sales of either 2qLC (more than qLM ) or 2qSE (which could easily be less than qH M ) when no such commitment is possible. Thus the need to signal no opportunism induces a greater variation in contracts as conditions vary. Thirdly, and most importantly, let us return to the policy implications of the result. It would be desirable for antitrust authorities to allow (indeed, encourage) measures, such as vertical integration and exclusive dealing by high-cost types, at least in situations where the separating output is below the monopoly output. By contrast, foreclosure by the low-cost type should be prevented for the same reasons as under complete information. This is the ideal policy, but the informational requirements for implementing it may be formidable. However, it may be that antitrust authorities are often in a better position to demand cost audits of upstream firms than are downstream firms. 26. I believe that this is the only perfect Bayesian equilibrium given the finite horizon, the market-by-market assumption, the use of contract menus à la Maskin–Tirole in each period with α < α ∗ , and appropriate restrictions on the ability to “punish by beliefs” across periods. 527 Foreclosure with Incomplete Information On the other hand, in situations where authorities are in no better position than downstream firms to determine whether upstream costs are high or low, it would be desirable to determine what happens to expected welfare under vertical separation and integration. Clearly, welfare will increase with integration if and only if the probability of the high type is large enough. In particular, the probability of the high type must be larger than the critical α ∗W defined by: q HM M qL ∗ ∗ αW [ p(q ) − c H ] dq = − 1 − αW [ p(q ) − c L ] dq q SE qC Unfortunately, it is difficult to get general results about the welfare changes from the move from vertical separation to vertical integration without making specific assumptions about the demand function. The problems are akin to those encountered in the literature on third degree price discrimination: output shrinks in one market, and expands in another, and the welfare effects depend on the form of the demand function. In our case, the two “markets” occur probabilistically: with some probability we have a low-cost firm and output shrinks, but with complementary probability we have a high-cost firm and output may rise. Rather than enter into these difficulties, we content ourselves with providing a simple numerical example in the appendix to show that welfare can increase with foreclosure, though in general the result is ambiguous. 6. Conclusions The last decade has seen a trend towards a tougher policy stance on vertical restraints by antitrust authorities.27 This trend has been associated with a more critical approach to the old Chicago School arguments in the academic literature, and the so-called new foreclosure doctrine forms a part of this. The papers expounding this doctrine show that an upstream monopolist will generally find it difficult to commit to supplying only the monopoly quantity to downstream firms, but that vertical integration or exclusive dealing helps him to do so. Therefore vertical integration or exclusive dealing are profitable but reduce output and social welfare. In this paper, we have shown that this conclusion is not robust to the introduction of incomplete information about the monopolist’s cost. When downstream firms do not know the upstream firm’s cost, a high-cost upstream firm has a strong incentive to signal to them that his cost is high (and consequently that sales to their rivals will 27. See, for example, Klass and Salinger (1995) and more recently, the survey by Riordan (2005). 528 Journal of Economics & Management Strategy be low). To credibly signal his type, the high-cost firm may have to cut his output below his desired monopoly output. Thus the signaling effect on the unintegrated high-cost firm’s output can easily offset his Coasian tendencies to oversupply. When that happens, allowing the high-cost firm to vertically integrate or sign an exclusive dealing contract increases output, profits and welfare, because the need to signal disappears along with the fear of opportunism. By focusing exclusively on complete information models, the reduction in output associated with signaling under vertical separation have been overlooked, leading to one-sided policy conclusions. We argue for a more balanced attitude on the part of competition authorities, because foreclosure will be damaging for low-cost firms but may be Pareto improving for high-cost firms. Appendix A: The High Information Intensity Equilibrium One reason why foreclosure might be generally bad for welfare is that before the monopoly output is reached, the distortion away from his best response in the LII equilibrium becomes so costly that UH prefers to offer a contract that is profitable only in expectation, and not typeby-type. Such a contract is known as a high information intensity contract because of the fact that downstream firms make losses on some cost-types means that their payoff is sensitive information about the upstream firm’s cost type. The reasons why such an equilibrium might not survive equilibrium refinements or investigation of upstream costs by downstream firms are discussed in the text. Here we nevertheless investigate such equilibria, since as we will see, they allow the highcost firm to expand his output relative to the low information intensity equilibrium and so (if they occur) lead to the conclusion that foreclosure will be more harmful than in LII equilibria. To characterize the high information intensity equilibria that might arise when α > α ∗ , we proceed in the following way. First we find the L(R) function, which is the minimum loss, which D1 must bear if the low-cost firm is to get a rent R above what he receives in the separating equilibrium. Setting up and solving this program, e −L(R) = max(over T1L , q 1L ) : UD1 = max − T1L + π q 1L , q 2L e s.t. T1L − c L q 1L ≥ R + π1L q 1SI , q 2L − c L q 1SI we find, not very surprisingly, that the most efficient way to give the low-cost firm a rent R is not to distort his quantity choice away from e BR(q2L | cL ), but simply to increase the tariff TL by this amount R. This means that L(R) = R. Foreclosure with Incomplete Information 529 Now the high-cost type considers maximizing his payoff when he has more slack on UL ’s IC and less on D1 ’s IR than in the LII program. Program HII: Max(over T1H , q 1H , R): UU H = T1H − c H q 1H e − T1H − (1 − α)L(R) ≥ 0 s.t. α π q 1H , q 2H e & UL (T1H , q 1H ) = T1H − c L q 1H ≤ R + π1L q 1SI , q 2L − c L q 1SI Setting up the Lagrangean with multipliers λ and µ on the first and second constraints respectively, the expression for q1H is then given by Max (over T1H , q 1H , R): e L = T1H − c H q 1H + λ α π (q 1H , q 2H − T1H ) − (1 − α)R e + µ T1H − c L q 1H − R − π1L q 1SI , q 2L + c L q 1SI + ρ R, ∂ L/∂ T1H = 1 − λα + µ = 0, (A1) ∂ L/∂ R1H = −λ(1 − α) − µ + ρ = 0, (A2) e ∂ L/∂q 1H = −c H + λαπ q 1H , q 2H − µc L = 0, (A3) e ∂ L/∂λ = α π q 1H , q 2H − T1H − (1 − α)R = 0, (A4) e ∂ L/∂µ = T1H − c L q 1H − R − π1L q 1SI , q 2L + c L q 1SI = 0, (A5) ∂ L/∂ρ = R = 0 (A6) or ρ = 0. Equations (A1) and (A2) imply that if pooling is profitable (i.e., R > 0, ρ = 0) − λ(1 − α) = −(1 − λα) →λ = 1 and µ = −(1 − α). Substituting into (A3) e ∂ L/∂q 1H = −c H + απ q 1H , q 2H (A7) + (1 − α)c L = 0, which yields the optimal “pooling” output as the implicit solution to e α π (q 1H , q 2H − c H − (1 − α)(c H − c L ) = 0. (A8) e Thus for α < 1, output is strictly below the best response to q2H , with the size of the distortion decreasing in α. 530 Journal of Economics & Management Strategy One can find the critical α at which pooling is just profitable by making the following line of reasoning. At the separating output qSE , imitation by the low-cost type may have forced the high-cost type to produce qSE such that π (qSE , qSE ) − cH > 0. In this case, output could profitably be expanded if it were not for imitation by the low-cost type. But imitation by the low-cost type can be avoided by giving the low-cost type more rent R on his own contract. So consider the following strategy for the high type: increase output by ε units, and increase the tariff T1H by εcH units, so that the change is just profitable. The downstream firm D1 can now expect to make a profit of approximately ε(π (qSE , qSE ) − cH ) units when he faces the high-cost type, which happens with probability α. This means that he will be willing to accept a loss of α/(1 − α) × ε (π (qSE , qSE ) − cH ) on the low-cost type, which equals the extra rent that can be given to the low-cost type to encourage him not to imitate. This extra rent will be just sufficient if it covers the rent that the low-cost type could earn by copying the high-cost type’s increase in output ε(cH − cL ). Thus the critical α ∗ at which pooling occurs solves α ∗ = Min α such that α/(1 − α) × (π (q SE , q SE ) − c H ) ≥ (c H − c L ), That is, α ∗ = (c H − c L )/(π (q SE , q SE ) − c L ). If the two types have strictly different marginal costs, and assumption A1 holds (so the high type’s LII allocation is distorted) then α ∗ > 0. This establishes Lemma 3 in the text. Another way to reach the same conclusion is to use the implicit expression for pooling output equation (A8) above. When the constraint R ≥ 0 just ceases to bind, the optimal pooling and separating outputs coincide. Substituting π (qSE , qSE ) as the derivative of downstream revenue in equation (A8) therefore gives the same expression for the critical pooling α∗. Appendix B: Proof of Proposition 4 Assume that the total revenue function TR(Q) (sum of revenues from sales to D1 and D2 ) is single-peaked and symmetric. The argument will proceed by using symmetry to find the derivative of TR(.) at the separating output, and from there to showing that high-cost profits must C be greater at qSE than at qH . Step 1: The derivative of TR(.) at qLC . dR1 dR2 d (TR(q 1 , q 2 )) = + dq 1 dq 1 dq 1 531 Foreclosure with Incomplete Information At q LC , dp d R1 = p + q1 = c, dq 1 dq 1 d R2 dp q2. = q2 · = dq 1 dq 1 q1 d R1 −p . dq 1 Now, in equilibrium, q1 = q2 = q. So consider the derivative of profit when changing both outputs at the same time: d 1 ∂ 1 ∂ 1 [ /2T R(2q ) − cq ] = [ /2T R(2q )] dq 1 + [ /2T R(2q )] dq 2 − c dq ∂q 1 ∂q 2 d R1 = 2×1/2 2 − p − c at q LC dq = 2c − p − c = c − p. Step 2: The derivative of TR(.) at qSE . If the profit function is symmetric, then because the level of profit at qSE is the same, the derivative must be of equal magnitude and opposite sign: d 1 [ /2T R(2q ) − cq ]|q =q SE = −c L + p q LC . dq Step 3: The derivative of high-cost profits. Now consider the high-cost firm. Its profits when q1 = q2 = q are − cH q. So the derivative of high-cost profits at any q are given 1 TR(2q) 2 by d 1 d 1 /2TR(2q ) − c H = /2TR(2q ) − c L + (c L − c H ) dq dq = derivative of UL ’s profits + (c L − c H ) Thus the derivative of high-cost profits at qSE are − cL + p(qCL ) + (cL − cH ) = p(qLC ) − cH . By A1, qSE is to the left of qHC . If the profit function is downward-sloping at qSE , (this is the case if p(qLC ) − cH < 0), we are still to the left of the maximum, so qSE must yield more profit than qCH . So suppose instead we are on an upward sloping part (p(qLC ) − cH > 0). Because profit functions are symmetric, this will represent higher profits as long as the function is flatter at qSE than at qHC (because this implies that we are closer to the optimum). This is true if the sum of the gradients is negative. By reasoning analogous to that above, the gradient at qHC must be cH −p(qHC ). The sum is thus p(qLC ) − cH + cH − p(qHC ), which is indeed always negative, because the price is higher with high-cost Cournot output than low-cost Cournot output. 532 Journal of Economics & Management Strategy Appendix C: A Simple Numerical Example of Welfare-Enhancing Vertical Integration In this section we provide a simple numerical example to show that the output of the high-cost firm can easily fall below his own monopoly output and that average welfare can therefore increase with foreclosure. Suppose we have: Linear Demand : p = a − b(q 1 + q 2 ), with a = b = 1 Constant marginal costs c L = 0.1, c H = 0.2. We saw above that the low type always produces his Cournot output.28 Here, Cournot output for the low-type is q1L = 0.3 and the low-type’s profit is 0.09 (=T1L − cL q1L ) from D1 and similarly from D2 . This is the low-type’s symmetric information payoff. Thus the least-cost separating program for the high-cost type simplifies to Max(over T1H , q 1H ): T1H − c H q 1H (I C : UL → UH ) UU L (T1L , q 1L ) = 0.09 ≥ T1H − c L q 1H = UU L (T1H , q 1H ) e (IR: D1 ; H) ≥ 0. UD1 (T1H , q 1H ) = −T1H + π q 1H , q 2H Supposing that both constraints bind, we have T1H = π (q1H , qe2H ), and 0.09 = π(q1H , qe2H ) − cL q1H = (a − b(q1H + q2H ))q1H − cL q1H . Letting qe2H = 0.15 gives q1H also equal to 0.15, so this is clearly an equilibrium, because the high-type is maximizing in both markets subject to not being imitated by the low-type.29 So, q1H = q2H = 0.15 is the high-type’s separating (LII) allocation. Notice that the monopoly output for the high-cost type is 0.4 > 2 × 0.15, so the high-cost type is producing strictly less than his monopoly output in order to separate himself from the low-cost type. Clearly then, output and welfare improve when the high-cost type is allowed to take measures (e.g., vertical integration, exclusive dealing, and nondiscrimination clauses) which would allow him to achieve monopoly profits and output. 28. Actually this continues to be true in the high information intensity equilibrium, as we show in Appendix A. 29. Could there be an HII asymmetric equilibrium? The answer is no, because (the symmetry of the downstream markets means that) in that instance there would always be some slack on one of the constraints, so the high-cost type could do better by changing his output. For example, if qe2H were lower, say 0.14, q1H would have to fall to 0.147 in order to be incentive compatible for the low-type with D1 . Then considering the contract with D2 , we would have: π (0.147, 0.14) − cL q2H = (1 − (0.147 + 0.14))(0.14) − (0.1)(0.14) = 0.08582 < 0.09 Thus the incentive constraint in D2 ’s market is slack. So U should increase output in this market. Anticipating this, D1 should not expect qe2H = 0.14. Foreclosure with Incomplete Information 533 Of course, if the low-cost type is allowed to take such measures, welfare will fall. In case the antitrust authorities are unable to determine whether the upstream firm has output below monopoly levels due to signaling (i.e., high-cost) or above monopoly levels due to lack of commitment (i.e., low-cost), it may be interesting to look at what happens to expected welfare under vertical separation and integration: Under Vertical Separation we have L-type LII profit = 0.09 × 2 = 0.18; L-type consumer surplus = 0.18 Consumer + Producer surplus with low-cost type: 0.36 H-type LII profit = 0.075 × 2 = 0.15; H-type consumer surplus = 0.045 Consumer + Producer surplus with high-cost type: 0.195. Whereas if we had the monopoly output (e.g. through vertical integration), L-type monopoly profit = 0.2025; L-type consumer surplus = 0.10125 Consumer + Producer surplus with low-cost monopoly: 0.30375 H-type monopoly profit = 0.16; H-type consumer surplus = 0.08 Consumer + Producer surplus with low-cost monopoly: 0.24. It is straightforward to show that for any α ≥ 5/9, expected welfare will be higher under monopoly than under separation if the LCS equilibrium is played. In particular, if α = 0.56, we have Weighted average welfare under monopoly = 0.56(0.24) + 0.44(0.30375) = 0.26805 Weighted average welfare in LCS equilibrium = 0.56(0.195)+ 0.44(0.36) = 0.2676. Note however, that in this example, the critical α ∗ below which the LCS is the unique equilibrium is only 2/11, so that the LCS is not the only possible equilibrium outcome in the absence of integration.30 30. Calculations for the high information intensity optimum for the high-cost type (see Appendix A) when α = 5/9 yield an output of 6/25 for the high-cost type (with the lowcost type continuing to produce its symmetric information output), resulting in aggregate welfare of approximately 0.41. Interestingly, in this example, the high-cost type makes higher profits in the HII allocation than in the LCS allocation, although there is no reason to suppose that this is generally the case, because the HII is applied “market-by-market.” It allows the high-cost type to profitably expand output with, say D1 , taking the profits with D2 as given; but of course in equilibrium D2 will be willing to pay less for a given output if he anticipates that U will offer the HII rather than the LCS allocation to D1 . 534 Journal of Economics & Management Strategy References Alexander, C. and D. Reiffen, 1995, “Vertical Contracts as Strategic Commitments: How Are They Enforced?” Journal of Economics and Management Strategy, 4, 623–649. Ausubel, L. and R. Deneckere, 1992, “Durable Goods Monopoly with Incomplete Information,” Review of Economic Studies, 59, 795–812. Baake, P., U. Kamecke, and H.-T. Normann, 2004, “Vertical Foreclosure versus Downstream Competition with Capital Precommitment,” International Journal of Industrial Organization, 22, 185–192. Bork, R., 1978, Anti-Trust Paradox, New York: Basic Books. Chemla, G., 2003, “Downstream Competition, Foreclosure, and Vertical Integration,” Journal of Economics and Management Strategy, 12, 261–289. Chen, Y., 2001, “On Vertical Mergers and Their Competitive Effects,” RAND Journal of Economics, 32, 667–685. Cho, I.-K. and D. Kreps, 1987, “Signaling Games and Stable Equilibria,” Quarterly Journal of Economics, 102, 179–221. Choi, J. and S.-S. Yi, 2000, “Vertical Foreclosure with the Choice of Input Specifications,” Rand Journal of Economics, 31, 717–743. Church, J. and N. Gandal, 2000, “Systems Competition, Vertical Merger and Foreclosure,” Journal of Economics and Management Strategy, 9, 25–51. Coase, R., 1972, “Durability and Monopoly,” Journal of Law and Economics, 15, 143–149. Cooper, J., L. Froeb, D. O’Brien, and M. Vita, 2005, “Vertical Antitrust Policy as a Problem of Inference,” Mimeo, Federal Trade Commission. DeGraba, P., 1996, “Most-Favored-Customer Clauses and Multilateral Contracting: When Nondiscriminiation Implies Uniformity,” Journal of Economics and Management Strategy, 5, 565–579. —— and A. Postlewaite, 1992, “Exclusivity Clauses and Best Price Policies in Input Markets,” Journal of Economics and Management Strategy, 1, 423–454. Hart, O. and J. Tirole, 1990, “Vertical Integration and Market Foreclosure,” Brookings Papers on Economic Activity, Microeconomics, 205–285. Hovenkamp, H., 2001, “Post-Chicago Antitrust: A Review and Critique,” Columbia Business Law Review, 2001, 257–337. Katz, M., 1987, “Vertical Contractual Relations,” Chapter 11 in the Handbook of Industrial Organization, Vol. 1, Amsterdam: North Holland. Klass, M. and M. Salinger, 1995, “Do New Theories of Vertical Foreclosure Provide Sound Guidance for Consent Agreements in Vertical Merger Cases?” Antitrust Bulletin, 40, 667–698. Klemperer, P. and M. Meyer, 1989, “Supply Function Equilibria in Oligopoly Under Uncertainty,” Econometrica, 57, 1243–1277. Kreps, D. and J. Scheinkman, 1983, “Quantity Pre-Commitment and Bertrand Competition Yield Cournot Outcomes,” Bell Journal of Economics, 14, 326–337. Lafontaine, F. and M. Slade, 1997, “Retail Contracting: Theory and Practice,” Journal of Industrial Economics, 45, 1–25. —— and —— 2005, “Empirical Assessment of Exclusive Contracts,” forthcoming in Paolo Buccirossi, ed., Handbook of Antitrust Economics, Cambridge, MA: MIT Press. Ma, C.-T.A., 1997, “Option Contracts and Vertical Foreclosure,” Journal of Economics and Management Strategy, 6, 725–753. Martin, S., H.-T. Normann, and C. Snyder, 2001, “Vertical Foreclosure in Experimental Markets,” RAND Journal of Economics, 32, 466–496. Marx, L. and G. Shaffer, 2001, “Opportunism and Non-Discrimination Clauses,” Mimeo, University of Rochester. Foreclosure with Incomplete Information 535 Maskin, E. and J. Tirole, 1992, “The Principal-Agent Relationship with an Informed Principal II: Common Values,” Econometrica, 60, 1–42. McAfee, R. and M. Schwartz, 1994, “Opportunism in Multilateral Vertical Contracting: Non-Discrimination, Exclusivity and Uniformity,” American Economic Review, 84, 210– 230. O’Brien, D. and G. Shaffer, 1992, “Vertical Control with Bilateral Contracts,” RAND Journal of Economics, 23, 299–308. —— and —— 1997, “Non-linear Supply Contracts, Exclusive Dealing and Equilibrium Market Foreclosure,” Journal of Economics and Management Strategy, 6, 755–785. Ordover, J., G. Saloner, and S. Salop, 1990, “Equilibrium Vertical Forclosure,” American Economic Review, 80, 127–142. Perry, M., 1987, “Vertical Integration” Chapter 4 in the Handbook of Industrial Organization Vol. 1., Amsterdam: North Holland. Posner, R., 1979, “The Chicago School of Anti-Trust Analysis,” University of Pennsylvania Law Review, 127, 925–948. Reiffen, D. and M. Vita, 1995, “Is There New Thinking on Vertical Mergers?” Anti-Trust Law Journal, 63, 917–936. Rey, P. and J. Tirole, 2005, “A Primer on Foreclosure,” Mimeo, IDEI, Toulouse, forthcoming, Handbook of Industrial Organization. —— and T. Vergé, 2004, “Bilateral Control with Vertical Contracts,” RAND Journal of Economics, 35, 728–746. Riordan, M., 1998, “Anti-Competitive Vertical Integration by a Dominant Firm,” American Economic Review, 88, 1232–1248. ——, 2005, “Competitive Effects of Vertical Integration,” Mimeo, Columbia Business School, New York, forthcoming in Paolo Buccirossi, ed., Handbook of Antitrust Economics, Cambridge, MA: MIT Press. —— and S. Salop, 1995, “Evaluating Vertical Mergers: A Post-Chicago Approach,” Antitrust Law Journal, 63, 513–568. Segal, I. and M. Whinston, 2003, “Robust Predictions for Bilateral Contracting with Externalities,” Econometrica, 71, 757–791. Spengler, J., 1950, “Vertical Integration and Anti-Trust Policy,” Journal of Political Economy, 58, 347–352. Tirole, J., 2006, “The Theory of Corporate Finance” Princeton, NJ: Princeton University Press. Whinston, M., 2003, “On the Transaction Cost Determinants of Vertical Integration,” Journal of Law, Economics, and Organization, 19, 1–23. White, L., 2003, “Foreclosure with Incomplete Information,” CEPR Discussion Paper no. 3942. Williamson, O., 1975, Markets and Hierarchies: Analysis and Antitrust Implications, New York: Free Press. —— 1985, The Economic Instititutions of Capitalism: Firms, Markets, Relational Contracting, New York: Free Press.
© Copyright 2026 Paperzz