INFERENCE FOR LARGE GAMES WITH EXCHANGEABLE PLAYERS KONRAD MENZEL† Abstract. We develop a sampling theory for games with a large number of players that are exchangeable from the econometrician’s perspective. We show that in the limit, heterogeneity can be separated into an individual and an aggregate component, and establish conditional laws of large numbers and central limit theorems given the aggregate state of the game. We then develop estimation procedures that eliminate heterogeneity in the aggregate through conditioning and invariance properties, which allow consistent estimation and inference based on information from a finite number of games as the number of players grows large. Estimation is based on the marginal distribution of individual observable characteristics and choices, where we treat the equilibrium selection rule as an unknown nuisance parameter. We illustrate the applicability of our results for aggregative games. JEL Classification: C12, C13, C14, C15 Keywords: Incomplete Models, Multiple Equilibria, Exchangeability, Large Games The analysis of strategic interactions among individuals or firms has always been a central theme in economics, and there has been a substantial effort to develop econometric tools for estimation and inference for games. However, existing methods for estimation and inference rely on the availability of data on a large number of independent realizations of a game with a given number of players. In this paper, we develop a sampling theory for settings with strategic interaction among a large number of agents, where players in the finite game represent exchangeable draws from a common distribution. Game-theoretic models pose several technical difficulties - strategic interdependence typically makes statistical objects of interest hard to compute and also induces statistical dependence among different agents’ choices. Furthermore economic theory does not make unique predictions in the presence of multiple equilibria. We show that if agents are exchangeable, it is possible to obtain conditional limiting results that allow us to address each of these concerns at least for some relevant settings. More specifically, exchangeability implies asymptotic conditional independence of players’ type-action profiles given their empirical distribution. This has three key consequences: • In the many player limit of a game, only the marginal distribution of types and actions is informative about economic parameters, whereas their joint distribution does not provide additional identifying power. Date: February 2012 - this version: August 2012. † NYU, Department of Economics, Email: [email protected]. 1 2 KONRAD MENZEL • Under many-player asymptotics we can use limit theory to derive conditional laws of large numbers and central limit theorems which can be used to derive consistency results and distribution theory for estimators and other statistics. • Since the marginal distribution of type-action characters can be estimated consistently from a random subset of players in the game, estimation and inference for incompletely observed games can be done using the same methods as for a complete sample, with only minor adjustments or corrections. Hence, in an asymptotic experiment which holds the number of games fixed but increases the number of players, in the limit exchangeability restrictions lead to a separation of heterogeneity at the aggregate and individual level, respectively. In this paper we show that individual heterogeneity can be dealt with using large-sample arguments like laws of large numbers (LLN) and central limit theorems (CLT). Aggregate heterogeneity can arise even in the many-player limit of a game e.g. from a high degree of strategic interdependence between agents’ choices or multiplicity of equilibria. Since economic theory and empirical evidence do not appear to make strong predictions regarding selection of equilibria, structural estimation of games typically has to treat the selection mechanism as an incidental parameter of the observed game. In this paper, we argue that considering asymptotic approximations to the empirical distribution of typeaction profiles for a large number of players suggests several ways of eliminating this nuisance parameter. We argue that there are a number of empirically relevant settings in which the potential nondegeneracy of the distribution of aggregate states does not preclude inference on structural payoff parameters. Instead, we consider conditional asymptotics which allows us to exploit invariance properties, and discuss approaches to eliminate nuisance parameters characterizing the aggregate state through conditioning. Furthermore, in many cases equilibrium distributions for the limiting experiment have a simpler structure than for the finite-player game, so that the limiting distributions may serve as computationally attractive approximations to the finite-player game. Our analysis focuses on discrete aggregate games with complete information. Discrete social interaction models with a large number of agents have been analyzed by Glaeser, Sacerdote, and Scheinkman (1996), Brock and Durlauf (2001), Brock and Durlauf (2007), Topa (2001), Liggett (2005) and Nakajima (2007) and others. However, this work assumes that player types are private information, or that decisions are made sequentially by agents that are not forward-looking. Our paper derives a new result on convergence of large anonymous economies where agents’ types may be common knowledge among all players. Hildenbrand (1971) showed convergence of competitive markets with random preferences to equilibrium points with respect to the GAMES WITH EXCHANGEABLE PLAYERS 3 limit distribution. Brock and Durlauf (2001) analyze Bayes perfect Nash equilibria in a discrete choice model with social interactions, where it is assumed that payoff shocks that are unobserved by the econometrician are also private information among players. In contrast, we explicitly allow for statistical dependence from strategic interaction when players have knowledge about their opponents’ types. Previous work in the theoretical literature on large games has focussed on purification and approximation properties of distributional games. Kalai (2004) shows that in large populations where types are private information, Bayes perfect Nash equilibrium is ex-post Nash in ε best responses. For dynamic games, Weintraub, Benkard, and van Roy (2008) show that oblivious equilibrium approximates a Markov-perfect dynamic equilibrium as the number of players grows large. Their work aims at exploiting the computational advantages of working with the large-player limit rather than the finite-player version of the game. In contrast, our focus is on robustness with respect to equilibrium selection, which requires that any finite-player equilibrium can be approximated by an appropriately chosen set of equilibria in the limiting game. In this sense, our analysis complements existing theoretical results on convergence of finite economies to continuous limiting games. However, it is important to point out that the limiting game will typically remain easier to solve, so that our approach is also in part motivated by computational considerations. Limiting results for large economies of interacting agents have been derived under the assumption that (a) agents are backward-looking and do not anticipate other players’ behavior (see e.g. Liggett (2005)), or (b) conditional on the common knowledge about types among agents, assuming that other preference shocks are private information (see Brock and Durlauf (2001)). Clearly, either assumption has its obvious merits especially for games with large numbers of players, however we show that it is possible to obtain comparable results for the complete information case. Limits of distributions of exchangeable network models have been considered in the literature on random graphs (see e.g. Diaconis and Janson (2008) and Bickel, Chen, and Levina (2011)), but their methods are not directly applicable to arrays that cannot be embedded in an infinitely exchangeable sequence: The dependence properties of finitely exchangeable arrays are generally different from those of infinitely exchangeable arrays, and since we are interested in characterizations of equilibria in structural models of social interactions we want to take those limits explicitly. While in general almost sure convergence results are generally difficult to obtain for triangular arrays, we show that in our setup, there are reasonable conditions under which we can obtain the row-wise stability properties needed for such a result. Our treatment also differs from Andrews (2005) in that in this paper the conditioning sigma field arises endogenously from strategic interaction among agents. In particular, we 4 KONRAD MENZEL are interested in structural inference, and hence the conditioning set is modeled explicitly in terms of economic primitives and discuss conditions for (unconditional) consistency of estimators of structural parameters. Conditional convergence results for infinitely exchangeable arrays - e.g. Blum, Chernoff, Rosenblatt, and Teicher (1958), or Andrews (2005) in econometrics - rely on de Finetti’s theorem or related ideas. Our main focus is on coupling a triangular array of sequences that are only finitely exchangeable, and to give the conditioning sigma-field a structural interpretation. Identification for mixture models in the case of (finite player) discrete games with multiple equilibria has been considered by Bajari, Hahn, Hong, and Ridder (2011). Our results on aggregative games show some parallels with Shang and Lee (2011)’s conditional analysis of the private information game with many players, and our analysis shows how to adapt their results to the complete information case which does not assume that players actions are i.i.d. conditional on observables. In the following, we are going to describe the framework we use to model social interactions, and section 3 shows how to establish the conditions for convergence from economic primitives for aggregate games with complete information. Section 4 gives generic asymptotic results for sample statistics from games with a large number of players, and section 5 concludes. 2. Setup Sampling Experiment. We consider samples that are obtained from a small number of instances m = 1, . . . , M, possibly only one single realization, of a static game (”market”), and we let nm denote the number of agents in the mth game. Each player i chooses an action si ∈ S from a set of pure actions S that is the same for all players and known to the researcher. There is heterogeneity across agents in that player i receives a public signal tmi ∈ T which we refer to as player i’s type, where individual types tmi = (x′mi , ε′mi )′ are i.i.d. draws from a common distribution Hm (t) = Hm (x, ε). Note that the distributions of types may differ across markets, and in particular we also allow for market-level shocks or covariates which often play an important role in the identification of interaction effects. We also allow the dimension of types ti to depend on market size, e.g. for matching or network formation problems it may be desirable to allow for pair-specific heterogeneity in the propensity to form a link between any two agents in the mth market. We do not assume that our data set for a given game is necessarily complete, but we assume that we observe type-action characters ymi = (smi , x′mi )′ for a random sample (with or without replacement) of Nm players in the markets m = 1, . . . , M, where the components εmi of player i’s type are not observed by the econometrician but may be known to other players. We also denote the action profile for the market by sm = (sm1 , . . . , smnm ) ∈ S nm , ′ )′ . and the type-action profile by ym = (y1′ , . . . , ym n GAMES WITH EXCHANGEABLE PLAYERS 5 Preferences and Equilibrium. For the purposes of this paper, we restrict our attention to the case in which agents choose among finitely many discrete actions, S := s(1) , . . . , s(p) so that S nm contains pnm different action profiles. For ease of exposition, we continue the exposition in terms of a given realization of the market m, and we omit the first subscript m in the following discussion whenever this does not lead to any ambiguities. Preferences are given by utility functions un : S n × T → R, uni (s) := un (s, ti ; θ) that depend on an unknown parameter θ. Since S is finite, the payoff function for player i can n ′ also be represented by a vector ũni = ũn (ti ; θ) := uni (s(1) ), . . . , uni (s(p ) ) , where pn is the dimension of the pure action space. Denote t = (t1 , . . . , tn ). We can then stack the vector of payoff functions as u = u(t; θ) := (ũn (t1 ; θ)′ , . . . , ũn (tn ; θ)′ )′ and denote the payoff space for n the game by U ⊂ Rnp . We also let Σ denote the simplex of probability distributions over Sn. Most econometric problem considered in this paper concern estimation and inference regarding θ or a proper subvector of the parameter assuming that chosen actions si , i = 1, . . . , n constitute an equilibrium according to a given solution concept for the market. More specifically, a solution concept Σ∗ is a set-valued mapping of payoff profiles to distributions of strategy profiles, Σ∗ : U ⇒ Σ where Σ∗ (u) ⊆ Σ is usually defined in terms of inequality conditions on agents’ payoffs which ensure that agents’ choices are strategically consistent with their opponents’ play. Example 2.1. (Binary Choice with Social Spillovers) Suppose there are n players, each of whom can choose from two actions, si ∈ {0, 1}, and payoffsare common o n players’ P 1 knowledge among all players and are given by ui (s, ti ; θ) = µ xi , n j6=i sj , θ + εi si for i = 1, . . . , n, where µ(·) is a known function and θ ∈ Rk is an unknown parameter and iid εi ∼ N(0, 1). As a solution concept for this game we assume that players actions are realizations of strategies that constitute a Nash equilibrium given players’ payoffs. There are several interpretations for this very stylized game, e.g. actions could be firms’ market entry decisions with firm-specific costs and Cournot competition on product market, or individual decisions in the presence of social contagion, stigma or peer effects. A private information version of this problem has been analyzed by Brock and Durlauf (2001) who analyze Bayes perfect Nash equilibria, where it is assumed that payoff shocks that are unobserved by the econometrician are also private information among players. Also, Nakajima (2007) estimates a model of peer effects in youth smoking behavior based on the stationary distribution of individual choices with adaptive behavior in which myopic agents make their choices based on past play. We now discuss how to relate a solution concept of a game to a set of distributions over players actions. First, consider a discrete game with an action set S = {s(1) , . . . , s(p) }. Then 6 KONRAD MENZEL in the complete information game, a mixed strategy for player i is a measurable map ( Tn → Σ ′ σi : t 7→ σi (t) := σi (t, s(1) ), . . . , σi (t, s(p) ) where the dependence on t may be restricted under different assumptions on players’ information. Note that the individual subscripts indicate that strategies do not have to be the same for all players of the same type, and we will maintain that level of generality throughout the paper. Using standard notation, we let σ−i = (σ1 , . . . , σi−1 , σi+1 , . . . , σn ) denote the profile of strategies played by all players except i. With a slight abuse of notation, we will denote player i’s expected payoff from choosing action si against the profile σ−i by P Q ∗ ui (si , σ−i ) = s−i ∈S n−1 ui (si , s−i ) j6=i σj (sj ). Then σ ∗ = σm is a Nash equilibrium of the ∗ ∗ ′ ∗ ′ game if σi (s) > 0 implies that ui (s, σ−i ) ≥ ui (s , σ−i ) for all s ∈ S, and we denote the set of the equilibrium strategy profiles by Σ∗m (t) = Σ∗m (um (t; θ)). We can now parameterize the set of distributions over player types and actions that are compatible with the equilibrium set with an equilibrium selection rule λ. A (possibly stochastic) equilibrium selection rule λ is a mapping from type-profiles to distributions over the equilibrium strategy profiles σ ∗ ∈ Σ∗m (um (t; θ)). The parameter space for λm consists of the mappings of types to the set of mixed Nash equilibria of the game Λm := {λm : Θ × T nm → Σm |λm (σm ; θ, t) > 0 only if σm ∈ Σ∗m (um (t; θ))} From the econometrician’s perspective we will treat the equilibrium selection rule λm in the mth market as a random variable in a cross-section of one or several observed markets. Some of our asymptotic arguments in the next sections require measurability of equilibrium outcomes along a particular filtration, so that the equilibrium selection mechanism has to be defined on a common probability space along sequences of games. However, in general this does not necessarily lead to a restriction of outcome distributions for the game at a given number of players. Note that under these assumptions, observed outcomes are statistically independent across markets, and the likelihood of ym = (y1 , . . . , ynm ) can be written as Z Y n X fm (y|θ, λ) = λ(σ ∗ )σi∗ (t, si )Hm (dt) ∗ ∗ Tn i=1 σ ∈Σ (t) where Hm (t) is the marginal distribution of ti . The resulting family of distributions of y is indexed by two parameters, the parameter of interest θ ∈ Θ which will be taken to be a k-dimensional vector, and we also parameterize the model incompleteness with λ ∈ Λ(θ) a stochastic equilibrium selection rule. Hence for any fixed value of λ we have a fully parametric GAMES WITH EXCHANGEABLE PLAYERS 7 likelihood, i.e. there is a set of distributions that is indexed with (θ, λ), y ∼ fY1 ,...,Ynm (y1 , . . . , ynm |θ0 , λm ) θ0 ∈ Θ, λm ∈ Λm (2.1) In particular, we assume that the observed data have been generated by one particular equilibrium (possibly in mixed strategies), where the equilibrium selection rule is not known to the econometrician. As we increase the number of players, the parameter space for the equilibrium selection rule, Λ = Λn grows very fast in dimension, so that we can’t hope to estimate λ0 consistently. Exchangeability. One substantive simplification arises if we assume that agents are exchangeable: recall that a random sequence Z1 , . . . , Zn with joint distribution f (z1 , . . . , zn ) is exchangeable if for any permutation π ∈ Π(1, 2, . . . , n), f (z1 , . . . , zn ) = f (zπ(1) , . . . , zπ(n) ). We will propose restrictions on the primitives of the game that yield distributions of type-action characters y1 , . . . , yn that are exchangeable across agents. This restriction reflects a particular kind of symmetry among different components of the observable data from the econometrician’s point of view. In particular, the identity of individual agents in a game may often be unknown or irrelevant, so that the parametric family f (y|θ, λ), θ ∈ Θ and λ ∈ Λm , should be invariant under permutations of, or within subsets of, the set of agents for each instance of the game.1 Many previous papers in the literature on estimation of games have assumed that the observed sample consists of i.i.d. draws of markets (see e.g. Ciliberto and Tamer (2009), Galichon and Henry (2011) and Beresteanu, Molchanov, and Molinari (2009)), which implies exchangeability of observed instances of the game. If the econometric analysis doesn’t distinguish between individuals players’ identities, then the likelihood is invariant with respect to permutations of agents within a given market. For example in the symmetric entry game considered by Bresnahan and Reiss (1991), for every payoff profile u the number of entrants is the same for all pure-strategy equilibria in Σ∗ (u). Since their analysis includes only market-level but no firm-specific characteristics, the empirical distribution of entry decisions is therefore uniquely defined if only pure equilibria are considered, and inference does not depend on the equilibrium selection mechanism. However, for the our purposes the most important implication of exchangeability is that for n large, the likelihood depends on the equilibrium selection rule λ0 only through the resulting P marginal distribution of type-action characters yi = (si , xi ), fY∗ (y|θ, λ) := n1 ni=1 fYi (yi |θ, λ) = f (y1 |θ, λ), and we are going to develop a limit theory for relevant features of that distribution as the number of players grows to infinity. For games with a small number of players, Menzel 1Note that if the game has a spatial structure where interactions depend on some notion of distance, the assumption of exchangeability would not be appropriate if the analysis is conditional on agents’ location. However unconditional procedures can allow for spatial interaction if agent locations are endogenous or can be modeled as part of their type. 8 KONRAD MENZEL (2011) proposes inference based on the invariant likelihood with respect to permutations of the set of players. However, the following classical result on sequences of finitely exchangeable random arrays suggests that a substantive simplification arises when the number of players is large: Theorem 2.1. (Theorem 3.1 in Kallenberg (2005)) Let Yn be an exchangeable sequence of length n with empirical distribution Fn (y|θ, λn). If Fn (y|θ, λn) converges weakly to a (possibly d random) limit F∞ (y|θ, λ∞), then Yn → Y , an infinite exchangeable sequence of i.i.d. draws from F∞ (y|θ, λ∞ ). This results states that in the limit, agents’ actions can be statistically dependent only through variation in the (stochastic) limiting distribution. Therefore exchangeability is a useful notion for describing outcomes of social interactions because it allows us to separate cross-sectional heterogeneity from aggregate uncertainty resulting from strategic interdependence or multiplicity of equilibria. This insight also has implications for identification arguments: Some approaches to identification rely on joint variation in observed or unobserved heterogeneity across players (see e.g. Tamer (2003) or Graham (2008)). However asymptotic conditional independence implies that with exchangeability, equilibrium outcomes may become uninformative about underlying preferences as markets grow ”thick,” and in the limit any identifying information on the parameter θ is contained in the marginal distribution of y1 .2 In particular arguments that are based on “identification at infinity” with respect to all players except for one will typically suffer from a curse of dimensionality with respect to the number of players and break down in the limit. While in general the joint distribution of ym contains more information about the parameter θ for any finite-player game than the empirical distribution of individual agents’ action-type profiles ymi , Theorem 2.1 suggests that under our exchangeability assumptions this difference is negligible when the number of agents in the market is large. Arguments based on exchangeability are also useful for the case in which only a random sample of players (in general without replacement) from a large game is available to the econometrician. Manski (1993) points out that the relationship between observations in a sample of that type is in general different from that between players in the population from which that sample was drawn. However since for large games the joint distribution can be approximated by i.i.d. draws from the empirical distribution of player-level type-action 2For example, Banerjee, Duflo, Ghatak, and Lafortune (2009) find that families placing marriage ads in an Indian newspaper report a preference for marrying a spouse within their own caste over a potential spouse from a different caste but with substantially higher income, whereas in the observed market outcome most families find a suitable spouse with comparable income within their own caste. GAMES WITH EXCHANGEABLE PLAYERS 9 characters, a sufficiently large subsample of players can be used to estimate the empirical distribution arising from the large-scale interaction model consistently even if it only represents a small share of the agents interacting at the population level. Empirical Distribution of Type-Action Characters. One general difficulty with structural inference in large games is that the parameter space Λn of equilibrium selection rules which are functions of all the nm players’ types - grows very fast in dimension as we let the number of players increase.3By Theorem 2.1, instead of considering the family of resulting joint distributions of the type-action profile in the mth market, f (ym1 , . . . , ymnm |θ, λm ) θ ∈ Θ, λm ∈ Λnm in the many-player limit we can restrict our attention to the empirical distribution of the exchangeable type-action characters which can be reparameterized as ∗ fm (ym1 |θ, γm) θ ∈ Θ, γm ∈ Γm The parameter space Γm consists of mappings from Θ to the set of equilibrium points of a properly defined limiting game. Most importantly, γ will not depend on the particular realization of t. This parameter set will in general differ across m = 1, . . . , M if we allow type distributions Hm (x, ε) to vary across markets, but does not depend on market size nm . We are going to illustrate that in many cases this constitutes a great simplification relative to the finite-player version of the game, especially when the set of equilibria in the limiting game is finite. For example in the symmetric entry game considered by Bresnahan and Reiss (1991), for every payoff profile u the number of entrants is the same for each of the pure-strategy equilibria in Σ∗ (u), so that under the assumption that entrants play a Nash equilibrium in pure strategies, the empirical distribution for si is uniquely determined. Furthermore in their analysis, there are market-level but no firm-specific characteristics and therefore inference does not depend on any assumptions regarding equilibrium selection. Example 2.2. Aggregate Games / Anonymous Interactions: In a finite action game, suppose that each player’s payoff only depends only on the fraction of players choosing each action s ∈ {s(1) , . . . , s(p) }. Also let λm ∈ Λm denote the distributions over equilibrium strat∗ egy profiles σm ∈ Σ∗m . We then show in Section 3 that under regularity conditions, the ∗ empirical distribution fm (Smi , Xmi |θ, λn ) of player’s actions Si and observable types Xi con∗ verges to a nonrandom limit fm (S1 , X1 |θ, γ) as nm → ∞, where γ ∈ Γ maps θ into the (finite) set of equilibrium points of the limiting average best-response correspondence. 3The generic maximal number of pure Nash equilibria in an n-player discrete game with p strategies is pn−1 (see McLennan (1997)). 10 KONRAD MENZEL We will analyze this problem in more detail in the next section. We will show that under regularity conditions, the limiting game has only a finite number of equilibrium distributions. For a given value of θ, the resulting parameter space Γm for the empirical distributions spans a finite-dimensional probability simplex whose dimension depends on the number of equilibria in the limiting game corresponding to the marginal type distribution Hm (x1 , ε1 ) and θ. Robust inference: In general, structural inference requires that we make statements about the parameter θ that do not depend on the nuisance parameter γ which indexes the possible limiting distributions for Ym1 = (Sm1 , Xm1 ). By conditional independence across agents in the limit, it is sufficient that we consider statistics that depend only on the empirical distribution of Ym1 , . . . , Ymnm . In the following we consider statistics at the market level that are of the form nm 1 X m(Ynm i ; θ) m̂nm (θ) := nm i=1 where m : Y × Θ → Rq is a vector-valued function of an agent’s type-action character. Considering the limiting experiment, we also denote the expectation with respect to the ∗ distribution fm (y|θ, γ) by Z ∗ m0 (θ, γ) = Eγ [m(Y1 ; θ)] := m(y; θ)fm (y|θ, γ)dy We consider empirical restrictions of the following form: we will assume that for every market m = 1, . . . , M, there exists a known set M0,m ⊂ Rq such that m0 (θ0 , γ) ∈ M0,m for all γ ∈ Γm In other words we can characterize the expectation of the statistic under any possible limiting distributions for the mth market. Example 2.3. (Aggregate Games) We will show that under the assumptions in section 4, the set of limiting distributions can be characterized as f∞ (y|θ0 , γ) where γ ∈ Γm the (finite) set of values G∗ solving a fixed point problem of the form Φ0 (G∗ ) = G∗ . In particular, we ∂ log f∞ (y|θ, Ĝ∗n ) and M0,m = {0}, will show that we can use moment conditions m(yi ; θ) := ∂θ where Ĝ∗n is a consistent estimator for G∗ . Example 2.4. (Moment Inequalities) If strategic choices are a result of economic optimization, then we can often derive moment equalities or inequalities on payoffs or profit functions, which under additional informational assumptions gives restrictions of the form E[m(Yin , θ0 )] ≥ 0 in any equilibrium of the game (see e.g. Bajari, Benkard, and Levin (2007) or Pakes, Porter, Ho, and Ishii (2006)), where for some s′ ∈ S m(Yin , θ0 ) := E[u(Sin , Xi ; θ) − u(s′ , Xi , θ)|Xi ] ⊗ GAMES WITH EXCHANGEABLE PLAYERS 11 ψ(Xi ), and ψ(x) is a q-dimensional vector of nonnegative functions of x. Then the empirical restriction for the limiting game is of the form m0 (θ0 , γ) ∈ M0,m := Rq+ the positive orthant in Rq . Clearly, these restrictions only have identifying power if the “population” moments m0 (θ, γ) vary with θ. Furthermore, the restrictions are uninformative against an alternative θ1 if the data generating process puts zero probability on the event {m(θ1 , γ) ∈ / M0,m } for a given distribution for γ. In particular, identification will be more difficult if the set M0,m is very rich due to a large number of equilibria. However we will show that in many large discrete games, the parameter set Γm can often taken to be finite. 3. Aggregate Games A class of normal form games with large numbers of players for which limiting results can be obtained are aggregate games in which players interact strategically only through a finite-dimensional aggregate state of the economy. In this section we consider a static, full information discrete game with players i = 1, . . . , n where players’ types ti = (xi , εi ) consist of characteristics xi ∈ X that are observed by the econometrician, and unobserved preference shocks εi . Player types ti are i.i.d. draws from the population distribution H(t) and are common knowledge among all agents. Each player selects an action from a finite set S = (s(1) , . . . , s(p) ), and we denote the empirical distribution of actions in the market with Ĝn (σ) = (Ĝn (s(1) ; σ), . . . , Ĝn (s(p) , σ))′ , where n 1X σj (s), s ∈ S Ĝn (s; σ) := n j=1 We also let ∆S denote the space of distributions over S represented by the (p−1) dimensional probability simplex endowed with the Euclidean distance d(·, ·). Interactions among players are anonymous in that we assume that payoffs are of the form ui (si , σ−i , ti ) = u ti , si , Ĝn ((si , σ−i )); θ where, with some abuse of notation, we let (si , σ−i ) denote the mixed strategy profile (σ1 , . . . , σi−1 , δsi , σi+1 , . . . , σn ). In particular, we assume that each agent’s payoffs only depend on her own type, but not that of any other players. Note that with minor changes to our notation, we can easily extend this framework to allow for preferences to depend on the joint distribution of actions and indicators for a finite partition of the type space. For example, we may want to allow for the respective number of male and female smokers to affect a male or female player’s payoff to a different degree. 12 KONRAD MENZEL For finite type spaces, this model has been considered by Kalai (2004) and Al-Najjar (2006) and includes the economies analyzed by Acemoglu and Jensen (2009) and Martimort and Stole (2011). This setup differs from the binary choice model with social interactions in Brock and Durlauf (2001) in that their analysis assumes that players’ types are private information, however by the laws of large numbers derived in Kalai (2004) and this paper this model becomes equivalent to a setting with common knowledge of types as the number of players grows large.4 Brock and Durlauf (2007) establish point identification for the privateinformation binary choice game under conditional quantile restrictions on the distribution of εi given xi . In the following, we will denote n 1 G− σ n−1 n−1 for σ, G ∈ ∆S. Given payoff functions, we also define the set of best responses to an aggregate profile Gs as G̃n,−1(G, σ) := ψ0 (G, t) := conv {δs |s ∈ S : ∀s′ ∈ S : u(s, G, t) ≥ u(s′, G, t) } where δs is the sth unit vector in Rp−1. Since in the following we have to account for multiplicity of best responses, we have to formulate the problem in terms of random sets and correspondences, where for ease of exposition, we relegate the definition of the relevant technical concepts to footnotes. In order to solve for equilibrium distributions of actions Ĝ∗n (s), define the correspondence of implicit best responses supporting an aggregate state G as5 n o ψn (G, t) := σ ∈ ∆S σ(s) > 0 only if s ∈ ψ0 (G̃n,−1(G, σ), t), and G̃n,−1(G, σ) ∈ ∆S Notice that the first condition on σ requires that agents randomize only between best ren 1 sponses to the empirical distribution n−1 G− n−1 σ. If preferences are locally strictly monotone in G, then ψn (G, t) will typically contain a finite (and usually odd) number of points for any finite n. We denote the projection of ψ(G, t) onto the sth dimension of the strategy space by ψn (s; G, t) := {σ(s) : σ ∈ ψn (G, t)} 4Horst and Scheinkman (2006) consider both global and local interaction among a fixed (but possibly infinite) set of agents, whereas Horst and Scheinkman (2009) extend that analysis to limits of finite-player economies. Their results only consider invariant equilibria and assume a Lipschitz condition on individual best response functions, which implies uniqueness of the equilibrium and excludes discrete action games with full information. 5Recall that a correspondence from X to Y, denoted by Φ : X ⇒ Y is a mapping from elements x ∈ X to subsets of Y GAMES WITH EXCHANGEABLE PLAYERS 13 Also define the set of types for which s is a best response to G by A(s, G) := {t ∈ T : u(s, G, t) ≥ u(s′ , G, t) for all s′ ∈ S} Assumption 3.1. (i) The payoff functions are of the form u(s, G, t), where u is uniformly continuous in G and t ∈ T for all s ∈ S, (ii) the collection A := {A(s, G) : s ∈ S, G ∈ ∆S} is a VC class of sets. There are a number of empirically relevant examples under which the second part of Assumption 3.1 is satisfied: Clearly, A is a VC class if the number of types t is finite as in Kalai (2004). Also if payoffs have a linear index structure in ti for all G, i.e. u(s, G, t) = v(t′ β(G), G) then the sets A(s, G) are intersections of the type space T with linear half-spaces in Rdim(t) , and therefore a VC class with index less than or equal to dim(t) + 1. This example is a generalization of the framework considered by Brock and Durlauf (2001). For the following, notice that the sum of two correspondences Φ1 : X ⇒ Y and Φ2 : X ⇒ Y is defined as the Minkowski sum of their images, Φ1 (x) ⊕ Φ2 (x) := {y1 + y2 |y1 ∈ Φ1 (x) and y2 ∈ Φ2 (x) } If all summands are singletons, Minkowski addition reduces to the standard notion of addition. We now define the aggregate response mapping n 1M Ψ̂n (G) := ψn (G, ti ) n i=1 and the population mean of the best-response correspondence Ψ0 (G) := E [ψ0 (G; T )] where E[·] is taken to denote the (Aumann) selection expectation of the random set ψ̃0 (G, T ) ⊂ ∆S and T ∼ H0 .6 Figure 3 illustrates the construction of Ψ̂n (G) from the individual best responses ψn (ti ; G) for a given realization of types in the binary action case. The following Lemma characterizes the set of Nash equilibria of the n-player game as the solutions to the inclusion G∗ ∈ Ψ̂n (G∗ ): 6The Aumann selection expectation of a closed random set X : Ω → 2X is defined as E[X] := closure{E[ξ] : ξ ∈ Sel(X)} where Sel(X) denotes the set of measurable selections ξ(ω) ∈ X(ω) such that Ekξk1 < ∞. See Molchanov (2005) for a full exposition. KONRAD MENZEL 1 1 0.9 0.9 0.8 0.8 0.7 0.7 0.6 0.6 b5 Ψ ψ̃(G, t, 5) 14 0.5 0.5 0.4 0.4 0.3 0.3 0.2 0.2 0.1 0.1 0 0 0 0.2 0.4 0.6 G 0.8 1 0 0.2 0.4 0.6 0.8 1 G Figure 1. The correspondence ψ5 (ti ; G) for five draws of ti in a five-player binary action game (left), and the corresponding aggregate response mapping Ψ̂5 (G) (right). Lemma 3.1. Suppose Assumption 3.1 (i) holds. Then there exists a Nash equilibrium with empirical distribution G∗ if and only if G∗ ∈ Ψ̂n (G∗ ), i.e. G∗ is a fixed point of the correspondence Ψ̂n . In particular, there exists a solution to the inclusion G∗ ∈ Ψ̂n (G∗ ) for each n = 1, 2, . . . with probability 1, and any distribution G∗ that is supported by a Nash equilibrium satisfies G∗ ∈ convΨ̂n (G∗ ), the convex hull of Ψ̂n (G∗ ). Proof: Suppose that G∗ ∈ / Ψ̂n (G∗ ). Then by definition of Ψ̂n , there exists at least one player i such that σi ∈ / ψn (G∗ , ti ), so that σi is not a best response to any profile σ−i P satisfying n1 nj=1 σj = G∗ . Conversely, if G∗ ∈ Ψ̂n (G∗ ), we can find σ1 , . . . , σn such that P σi ∈ ψn (G∗ , ti ) for all i = 1, . . . , n and n1 ni=1 σi = G∗ , so that G∗ is indeed generated by a Nash equilibrium. Existence of an equilibrium distribution G∗ for finite n follows from an existence theorem for Nash equilibria in finite games, see e.g. Proposition 8.D.2 in MasColell, Whinston, and Green (1995) Note that for the last part of Lemma 3.1 the converse doesn’t hold: there may be solutions to G ∈ convΨ̂n (G) that do not correspond to Nash equilibria. Below we show that the set n o ∗ ∗ ∗ ∗ of solutions to the inclusion Ĝn := G : G ∈ convΨ̂n (G ) approaches the set of solutions to G ∈ Ψ̂0 (G), which implies that the set of Nash equilibria in the finite game approximates a subset of G0∗ = {G∗ : G∗ ∈ Ψ0 (G∗ )}, but in general some solutions in G0∗ need not be approximable by sequences of (strict) Nash equilibria.7 7One approach in the literature has been to broaden the equilibrium concept to equilibria in ε-best responses, as in Kalai (2004). In contrast, our Assumption 3.2 below imposes that all equilibria in G0∗ are regular points of the limiting mapping, all of which are then shown to be approximable by (strict) Nash equilibria in the proof of Theorem 3.1. GAMES WITH EXCHANGEABLE PLAYERS 15 We will now give a condition that ensures that solutions to the limiting equilibrium mapping Ψ0 are locally unique, implying that the number of distinct equilibrium distributions is finite. We also want to allow for the limiting equilibrium mapping to be set-valued, which will in general be the case if the distribution of types is not continuous. For this we have to define a graphical derivative of a correspondence:8 The contingent derivative of the correspondence Φ at (x′0 , y0 )′ ∈ gph Φ is a set-valued mapping DΦ(x0 , y0 ) : X ⇒ Y such that for any u ∈ X Φ(x0 + hu′ ) − y v ∈ DF (x, y)(u) ⇔ lim inf d v, h↓0,u′ →u h where d(a, B) is taken to be the distance of a point a to a set B.9 Note that if the correspondence Φ is single-valued and differentiable, the contingent derivative is also single-valued and equal to the derivative of the function Φ(x). The contingent derivative of Φ is surjective at x0 if the range of DΦ(x0 , y0 ) is equal to Y. In the following, let the mapping Φ0 (G) := Ψ0 (G) − G. Assumption 3.2. (Regular Economy): For every θ ∈ Θ and population distribution H0 ∈ P, one of the following holds: (i) Ψ0 (G) is single-valued for all G ∈ ∆S, and at every distribution G∗ solving G∗ ∈ Ψ0 (G∗ ), ∇G Φ0 (G∗ ) is defined, and ∇G Φ0 (G∗ ) has rank equal to S − 1, or (ii) The contingent derivative of Φ0 (G) is lower semi-continuous and surjective for all G, and satisfies 0 ∈ gph DΦ0 (G∗ , G∗ )(G) if and only if G = G∗ at every distribution G∗ solving G∗ ∈ Ψ0 (G∗ ). Note that (i) is a special case of (ii) for which the conditions for local separation of equilibria are easier to interpret. In analogy with classical genericity analysis in general equilibrium theory, it is possible to derive primitive conditions for this assumption in terms of the population distribution of types based on the restrictions on payoff functions in Assumption 3.1. Furthermore, with payoffs depending on a parameter θ, this assumption imposes restrictions on the parameter space Θ. Although violations of these generalized rank conditions are typically “non-generic” in the sense that they correspond to subsets of Θ of (Lebesgue) measure zero, they will generally lead to unstable behavior of the economy for parameter values in the neighborhood of those singular points. However, conditions of this type are standard in econometric analysis of equilibrium models.10 8Recall that the graph of a correspondence Φ : X ⇒ Y is the set gph Φ := {(x, y) ∈ X × Y : y ∈ Φ(x)}. Definition 5.1.1 and Proposition 5.1.4 in Aubin and Frankowska (1990) 10See e.g. Assumption 2.1 in Hausman (1983) for the linear simultaneous equations model. 9See 16 KONRAD MENZEL Figure 2. Schematic illustration of Assumption 3.2. The bottom three panels are examples of a failure of the transversality condition (left) and/or the surjectivity condition (all three panels). In order to understand regularity for the case of a limiting correspondence Ψ0 , consider the case S = {0, 1}, so that G ∈ [0, 1]. If there is a fixed point G∗ such that Ψ0 (G∗ ) is a convex set with nonempty interior, then condition (ii) of Assumption 3.2 is violated if and only if there exists sequences Gn 6= G∗ with limn Gn = G∗ and Ψn ∈ Ψ0 (Gn ) such Ψn −G∗ that limn |G ∗ = 0. Cases in which the regularity condition fails include if for all G in a n −G | one-sided neighborhood of G∗ , Ψ0 (G) is set valued and contains G∗ , or if in that one-sided neighborhood Ψ0 (G) is single valued with limGn →G∗ Ψ0 (Gn ) = Ψ0 (G∗ ), and the one-sided derivative of Ψ0 (G∗ ) = Ip−1 , the (p − 1) dimensional identity matrix. See also figure 3 for graphical examples. If the distribution of types Ti is continuous and payoff functions are strictly monotone in each argument almost everywhere, then at every value of G players are indifferent between multiple actions with probability zero. Hence under Assumption 3.1, the limiting correspondence Ψ0 is typically set-valued only if the type distributions has atoms. Lemma 3.2. Under Assumption 3.2, the number of solutions of the inclusion G ∈ Ψ0 (G) is finite. Proof: Since the game is finite with |S| = p, the space of distributions ∆S is the (p − 1) dimensional probability simplex, and therefore compact. Under Assumption 3.2, part (i), GAMES WITH EXCHANGEABLE PLAYERS 17 each equilibrium G∗ is unique within an open neighborhood around G∗ by the inverse function theorem. If part (ii) of Assumption 3.2 holds, then each equilibrium G∗ is locally unique by Proposition 5.4.8 in Aubin and Frankowska (1990). We can use these open neighborhoods of the equilibrium points to cover ∆S, and by compactness, there exists a finite subcover N1 , . . . , NJ , say. By construction, each set Nj , j = 1, . . . , J contains at most one equilibrium point, so that the number of equilibria is at most J, and therefore finite This proof is very similar to the classical arguments for finiteness of the number of equilibria in regular market economies (e.g. Proposition 17.D.1 in Mas-Colell, Whinston, and Green (1995)) or finite-player games (Theorem 1 in Harsanyi (1973)). We can now state our main result on stochastic convergence of the equilibrium values of the aggregate Ĝn . Proposition 3.1. Suppose Assumptions 3.1 and 3.2 hold, and that Ĝn is a sequence of empirical distributions solving Ĝn ∈ Ψ̂n (Ĝn ). Then for G0∗ := {G∗ ∈ ∆S : G∗ ∈ Ψ0 (G∗ )} a.s. and as n → ∞, we have that (a) d(Ĝn , G0∗ ) → 0, and (b) with probability approaching 1, for every G∗0 ∈ G0∗ and every neighborhood B(G∗0 ) of G∗0 we can find Gn ∈ B(G∗0 ) such that Gn ∈ Ψ̂n (Gn ). See the appendix for a proof for this proposition. Given the conclusion of 3.2, part (a) of the conclusion implies that although the number of distinct equilibria may grow very fast as the number of players increase, the implied distributions of actions become concentrated around a finite set G0∗ . Taken together, parts (a) and (b) show that the set G0∗ of equilibrium values for the population correspondence Φ0 is equal to the set of limiting points of the set of equilibria in the n-player games. Note that the surjectivity condition in Assumption 3.2 is crucial in attaining the stronger conclusion (b). Note that we can interpret the limiting game as the private information version of the aggregate game, where players only know the distribution of opponents’ types, but not their realizations. It is straightforward to check that the solutions of the fixed-point problem G ∈ Φ0 (G) correspond to the Bayes perfect Nash equilibria (BPNE). Since by definition, individual actions arising from a BPNE only depend on a player’s own type, from the individual agent’s perspective, the informational requirements for optimal play are much lower in the limiting game than in the complete information version with finitely many players. Grieco (2012) investigated identification of discrete games under different informational assumptions. Our results, together with those of Kalai (2004) suggest that those differences appear to be less important in aggregate games with a large number of agents. Some of the asymptotic arguments in the next section require that the joint distribution of best responses varies continuously in the aggregate G or that the aggregate best response correspondence Ψ0 (G) is single-valued. More specifically, we introduce the following set of 18 KONRAD MENZEL additional assumptions on preferences and types for the results requiring a minimal degree of smoothness: Assumption 3.3. Types can be partitioned as ti = (t′1i , t′2i )′ such that (i) the conditional distribution of t1i given t2i is tight and continuous with a uniformly bounded continuous p.d.f., and (ii) either of the following holds: (a) (“Unordered Choice”) the subvector t1i ∈ Rp−1 , and for the payoff vector u(t, G) := u(s(1) , t, G), . . . , u(s(p) , t, G) , we have that ∇t1 u(t, G) has rank p − 1 for almost all t∈T. (b) (“Ordered Choice”) t1i ∈ R, and u(s(l) , t, G) − u(s(k) , t, G) is strictly monotone in t1 for k 6= l and almost all t ∈ T . (iii) xi is a subvector of t2i . Note that most standard parametric random utility models for choice among discrete alternatives meet either of these assumptions. Typically, specifications include additive alternative-specific taste shifters that are continuously distributed and have unbounded support conditional on observable characteristics. While part (b) of the assumption does not require a particular ordering of the alternative choices, classical specifications for ordered choice models include a scalar source of heterogeneity that satisfies the monotonicity condition on utility differences. The crucial difference between the two sets of assumptions are that part (a) does not impose any global conditions on the general shape of payoff functions but requires rich heterogeneity in preferences. On the other hand, the increasing differences requirement in part (b) allows to restrict the amount of heterogeneity needed to make the problem sufficiently smooth. Part (iii) requires that the unobservable taste shifters are sufficiently rich such that the smoothness properties stated below also apply to the conditional model given observable player characteristics, and will only be used for the derivation of asymptotic results in section 4. From a technical standpoint, Assumption 3.3 also greatly simplifies the analysis as the following lemma shows. Lemma 3.3. Suppose Assumption 3.3 (i) and (ii) hold. Then for almost all types ti , ψ0 (ti ; G) is single-valued, and the aggregate best-response correspondence Ψ0 (G) is single-valued at all values of G ∈ ∆S. The proof for this lemma is in the appendix. This result allows us to work with the simpler version of the transversality condition in Assumption 3.2 (i) and helps avoid tedious case distinctions in the characterization of sequences that approach mixed strategy equilibria in the limiting game when we turn to the construction of a coupling between games with different numbers of players below. GAMES WITH EXCHANGEABLE PLAYERS 19 4. Asymptotic Analysis This section develops conditional laws of large numbers and central limit theorems for statistic of the form nm 1 X m(Ynm i ; θ) m̂nm (θ) := nm i=1 given an appropriately specified tail sigma field. The following analysis only considers one single market, so that we can drop the m subscript without loss of generality. The results can then be applied for each m = 1, . . . , M separately, so that information from different markets can be combined for estimation or inference, where we can also allow for type distributions and parameters to be market-specific. In order to ensure that the finitely exchangeable sequences {Yni : 1 ≤ i ≤ n} are defined on the same probability space for all n ≥ 1, we introduce auxiliary types νi that are not payoff-relevant but determine choice among equilibria alongside with the payoff-relevant types ti = (x′i , ε′i )′ . In a given market, types are assumed to be exchangeable random variables with a fixed marginal distribution for payoff-relevant type: Condition 4.1. (Types and Preferences) Preferences are given by utility functions u : S × T → R, ui (s) := u(s, Xi , εi ; θ) that depend on an unknown parameter θ. The n players’ payoff-relevant types Xi , εi are i.i.d. draws from a distribution H(x, ε), and there is an augmented type Wi = (Xi′ , ε′i , νi′ ), such that W1 , W2 , . . . form an infinite exchangeable sequence. We will show below that we can typically allow νi to be rich enough to generate any mixture over equilibrium distributions at any sample size n. In particular, the limiting empirical distribution for the exchangeable sequence W1 , W2 , . . . will in general be non-deterministic. We now introduce a coupling of the sequence of type-action characters determined in equilibrium, Yn1, . . . , Ynn , to an infinitely exchangeable sequence Y1 , Y2 , . . . . We also define a filtration {Fn }n≥1 generated by symmetric events in that random sequence: Given the random sequence Y1 , Y2 , . . . we say that a random variable Zn is n-symmetric if it can be written Zn = rn (Y1 , . . . , Yn , . . . ) for some function rn (·) satisfying r(y1, . . . , yn , . . . ) = r(yπ(1) , . . . , yπ(n) , . . . ) for any permutation π of {1, . . . , n}. We then define Fn as the sigma algebra defined by all n-symmetric random variables, we clearly have F1 ⊇ · · · ⊇ Fn ⊇ Fn+1 ⊇ · · · ⊇ F∞ . The tail field F∞ is the sigma algebra generated by the marginal distributions of Y1 . The following conditions on the equilibrium selection rule are needed to embed the observable n-player game into a sequence defined on a common probability space that allows for meaningful conditional convergence results. 20 KONRAD MENZEL Condition 4.2. (Equilibrium Selection)(i) The sequence of equilibrium selection rules {λn }n≥1 , λn ∈ Λn is of the form λni = λn (wi , w1 , . . . , wn ) and consistent with the solution concept based on payoff functions ui (s) := u(s, ti ; θ). (ii) There exists a set Γ and a family of distributions {f (y1|θ, γ) : γ ∈ Γ} such that Y1 |γ̃ ∼ f (y1 |θ, γ̃) for some F∞ -measurable random variable γ̃ such that γ̃ ∈ Γ almost surely, and (iii) there exists a deterministic null sequence cn → 0 such that |E [ m(Yn1 ; θ) − m(Y1 ; θ) |Fn ]| < cn a.s. for all n. Part (i) of Condition 4.2 can be understood as a row-wise stability condition for the triangular array generated by equilibrium play, and its main purpose is to ensure that {Yin : 1 ≤ i ≤ n}n≥1 is adapted to a common filtration. It should also be noted that by restricting λni (·) to depend only on players’ types, but not their identities, the solution concept of the game has to treat players symmetrically, but for a given realization of ν1 , . . . , νn the realized equilibrium need not be symmetric. The functional form of λn (·) is otherwise arbitrary, so that apart from joint exchangeability of type-action characters, this condition does not appear to be restrictive regarding the row-wise distributions of the triangular array. In particular, we demonstrate that for aggregate games, we can generate any mixtures over not necessarily symmetric equilibria even after conditioning on the payoff-relevant components of types xi and εi . Part (ii) requires that the rows of the triangular array are coupled in a way such that the empirical distribution of type-action profiles converges almost surely to a well-defined limit. In many applications, existence of this limit is not essential but one may allow the sequence to oscillate among shrinking neighborhoods of a well-defined set of cumulation points. Hence this part may in many cases be weakened but for clarity of the exposition we maintain this slightly stronger condition. Note in particular that the (random) marginal distribution for Y1 is F∞ -measurable, so that measurability of γ̃ does not impose any additional restrictions. Below, we will illustrate how to establish Condition 4.2 from the primitive assumptions given in section 3. We maintain the following regularity conditions on the function m(·; θ) defining the moment function m̂n (θ) in conjunction with the type distribution H(x, ε) given in Condition 4.1: Condition 4.3. (Uniform Integrability) (i) The family {m(y; θ) : θ ∈ Θ} is a VC class of functions, and (ii) E|m((s, X1 ); θ)| and |E [m((s, X); θ) − m((s′ , X); θ)]| are bounded by a constant for all s, s′ ∈ S and uniformly in θ ∈ Θ. GAMES WITH EXCHANGEABLE PLAYERS Denote the limit m0 (θ, γ) := E[m(Y1 ; θ)|γ̃ = γ] = Z 21 m(y; θ)f (y|θ, γ)dy Since the distribution function F is measurable with respect to F∞ , strong convergence of m̂n (θ) with respect to the filtration {Fn }n≥1 implies almost sure convergence to m0 (θ, γ). This paper does not explicitly analyze point or partial identification for structural parameters of particular game-theoretic models, but we only assume that the model has empirical implications of the form that any almost sure limits of the quantities m̂n (θ) belong to a known, nontrivial subset of Rq . Condition 4.4. (Empirical Restrictions) Let γ̃ be defined as in Condition 4.2. Then there exists a set M0 ⊂ Rq such that m0 (θ, γ̃) ∈ M0 almost surely. This condition primarily serves to characterize the relationship between moments of the data and the structural parameter under the different tail events associated with different values of γ ∈ Γ. While it is common practise to discuss identification for finite-sample distributions and treat the asymptotic analysis for an estimation or inference procedure as a separate problem, identification properties of game-theoretic models may well change with the number of players. Since in this paper we describe the behavior of estimators or tests around their limits, it is more useful to think about identifying power of econometric restrictions in the many player limit than any finite size. Condition 4.4 applies to a variety of estimation problems - e.g. the function m(Y ; θ) could characterize a vector of moment inequalities derived individual optimization behavior, and this restriction could be written in the form m(F ; θ) ≥ 0 a.s., so that M0 = Rq . We will also give examples for which M0 consists of a finite set of points. We are now going to state our first asymptotic result which is a fairly straightforward adaptation of Birkhoff’s law of large numbers. Since this law of large numbers is uniform with respect to θ, it can be used to derive consistency results for θ or parameter sets based on Condition 4.4. Theorem 4.1. (Conditional LLN) Suppose Conditions 4.1-4.3 hold. Then n 1X a.s. m(Yin ; θ) → E[m(Y1 ; θ)|F∞ ] n i=1 as n → ∞. Moreover, convergence is uniform in θ. Proof: Note that the set of equilibria in the n-player game only depends on payoff profiles which are determined by the values of (X1 , ε1), . . . , (Xn , εn ), so that by Conditions 4.1 and 4.2 (i), observable type-action characters Yni := (Sni , Xi′ )′ are exchangeable for every n. By Condition 4.2, the column-wise limit of Yni as n → ∞ is well-defined with probability 22 KONRAD MENZEL one for every i = 1, 2, . . . , so that n-exchangeability of Yn1 , . . . , Ynn implies that the limiting sequence Y1 , Y2 , . . . is infinitely exchangeable. Now a standard argument (see e.g. Kingman (1978)) yields that for any n-symmetric event An ∈ Fn and any j = 1, . . . , n, # " n 1X m(Yi ; θ)1lAn E[m(Y1 ; θ)1lAn ] = E[m(Yj ; θ)1lAn ] = E n i=1 P Since n1 ni=1 m(Yi ; θ)1lAn is Fn -measurable, we get # " n n X 1 1X E[m(Y1 ; θ)|Fn ] = E m(Yi ; θ) Fn = m(Yi ; θ) n i=1 n i=1 Hence by Condition 4.3, the sequence n Zn∗ n 1X 1X m(Yni ; θ) + [m(Yi ; θ) − m(Yni ; θ)] := n i=1 n i=1 is a reverse martingale adapted to the filtration {Fn }∞ n=1 , so that by the reverse martingale a.s. theorem (e.g. Theorem 10.6.1 in Dudley (2002)), Zn∗ → E[m(Y1 ; θ)|F∞]. Since by Condition 4.2 (iii) n ∗ 1X m(Yni ; θ) = |E [m(Yi ; θ) − m(Yni ; θ)|Fn ]| < cn Zn − n i=1 almost surely with cn → 0, we also have n 1X a.s. m(Yni ; θ) → E[m(Y1 ; θ)|F∞ ] n i=1 Since almost sure convergence is joint for any random vector satisfying the conditions of this Theorem, uniformity in θ follows from Condition 4.3 following the same reasoning as in the proof of the (almost sure) Glivenko-Cantelli theorem for the i.i.d. case, see van der Vaart (1998), Theorems 19.1 and 19.4 The result can also be easily adapted to settings in which the econometrician only observes a random subsample of Nn < n players in the game. More specifically, let qi be an indicator variable that equals one if agent i is included in the sample, and zero otherwise, P and qi is independent of Y1 , . . . , Yn . Note that the sample average N1n ni=1 qi m(Yin ; θ) is i h P Fn -measurable with E Nnn qi m(Yin ; θ) Fn = n1 ni=1 m(Yni ; θ). Then, if Nnn is bounded, the conditions of Theorem 4.1 hold. so that we obtain the following conclusion: GAMES WITH EXCHANGEABLE PLAYERS 23 Corollary 4.1. Suppose the assumptions of Theorem 4.1 hold, and we observe a random sample of Nn ≤ n players’ type-action profiles, where Nnn is bounded by a constant. Then uniformly in θ as n → ∞. Nn 1 X a.s. m(Yni ; θ) → E[m(Y1 ; θ)|F∞] Nn i=1 For some applications, it will also be useful to allow for moment functions of the form m (Yi ; θ, η0 ) that also depend on an auxiliary parameter η0 ∈ H ⊂ Rl , where η0 is replaced by an estimator η̂n based on the sampling distribution of player types. For example, a restriction on m∗ (y; θ, η) may be derived from the scores corresponding to players’ best response to the limiting equilibrium value of η0 := G∗0 . In that case, results should also be sufficiently general to allow for the parameter η0 to be random even in the limit. We find that such an extension of the LLN only requires strong consistency of η̂n mild additional uniformity conditions with respect to η: ∗ Corollary 4.2. Suppose that η̂n → η0 a.s., where η0 is an F∞ -measurable random variable, and m∗ ((s, Xi ); θ, η) is continuous in η a.e. with respect to the distributions of η0 and Xi . Also let the assumptions of Theorem 4.1 hold for m(Yi ; θ) ≡ m∗ (Yi ; θ, η0 ), and Assumption 4.3 hold with constants and a VC index that is fixed with respect to η ∈ H. Then n 1X ∗ a.s. m (Yni ; θ, η̂n ) → E[m(Y1 ; θ, η0 )|F∞ ] n i=1 uniformly in θ as n → ∞. 4.1. Construction of Coupling. We now discuss the construction of a coupling of the type-action characters Yni of the finite-player aggregate game described in section 3 to an infinitely exchangeable sequence Y1 , Y2, . . . that satisfies the requirements for the conditional law of large numbers and central limit theorems in this section. The construction of the coupling requires that we specify the distribution of the auxiliary states ν1 , ν2 , . . . that are not payoff-relevant but determine which equilibria are chosen, and a function λn (·) has to satisfy the symmetry restrictions given in Condition 4.2. Finally we want the coupling to be flexible enough to allow to embed any possible cross-sectional distribution over Nash equilibria in the n player game into a converging sequence for some permissible choice of the functions λn . Proposition 4.1. (Coupling for Aggregate Games) Suppose Assumptions 3.1-3.3 holds. For the N-player game assume that the observed action profile is generated by a mixture over the Nash equilibria for the type-profile T1 , . . . , TN with probability one, and let fN∗ (s, x|θ) be the resulting unconditional distribution over type-action characters. 24 KONRAD MENZEL Then there exists an i.i.d. sequence ν1 , ν2 , . . . of random variables in Rp , and a sequence of functions λn : T n → S n such that fN (s, x|θ, λN ) = fN∗ (s, x|θ) for all s ∈ S n , and such that Condition 4.2 holds. The proof for this result is in the appendix. Since Assumptions 3.1-3.3 also imply Condition 4.1, we can apply the conditional LLN in Theorem 4.1 and obtain the following corollary: Corollary 4.3. Suppose the assumptions of Proposition 4.1 and Condition 4.3 hold. Then the average n 1X m(Yni ; θ) → E [m(Y1 ; θ)|F∞ ] a.s. n i=1 4.2. Asymptotic Distribution. For inference and confidence statements, we next develop a distribution theory that is conditional on the tail events in F∞ . To this end we are going to consider mixing convergence in distribution: We say that for a real-valued random sequence d Zn and a random variable Z convergence in distribution, Zn → Z, is mixing relative to the sigma-field F∞ if for all events A ∈ F∞ , lim P ({Zn ≤ z}|A) = P (Z ≤ z) n at all continuity points z of the c.d.f. of Z, see Hall and Heyde (1980). In general conditions on the coupling to obtain a CLT with mixing will be more restrictive than what was needed for a conditional LLN. We will focus on the case of aggregate games described in section 3, and propose a fairly intuitive approach to incorporating equilibrium conditions as additional estimating equations into a derivation of the asymptotic distribution of m̂n (θ). Recall that any Nash equilibrium G∗n in the aggregate game satisfies the fixed-point inclusion n 1M ψ̃n (ti , G∗n ) − G∗n 0∈ n i=1 We can then stack the moment and fixed point conditions, and consider the joint distribution of " # " # n ′ ′ M m((Xi , ψn (ti , G)) ; θ) m̂n (G, θ) 1 := n i=1 ψn (ti , G) Ψ̂n (G) Note that m(Yi ; θ) = m ((Xi′ , ψ0 (ti , G0 ))′ ; θ) =: m0 (ti , G0 ; θ) Similarly, we can write m(Yni ; θ) = m (Xi′ , ψn (ti , Ĝn ))′ ; θ =: mn (ti , Ĝn ; θ) √ We derive the asymptotic distribution of n(m̂n (θ) − m0 (θ)) conditional on F∞ by augmenting the estimating equations by a state condition which defines equilibrium in the limiting game, but only approximates the fixed-point condition for the finite player game: GAMES WITH EXCHANGEABLE PLAYERS 25 Given θ, we let the random variable G∗0 := E[Y1 |F∞ ], the limiting equilibrium value of G conditional on F∞ . Also, m∗0 := E[m0 (ti , G∗0 ; θ)|F∞ ], and µ0 := (m∗0 , G∗0 )′ . Similarly, define P m∗n := n1 ni=1 mn (ti , Ĝn ; θ) and µ̂n := (m̂n , Ĝn ). We can then stack the moment conditions defining µ̂n by defining the multi-valued function " # mn (ti , G; θ) − m rn (ti ; θ, µ) := ψn (ti ; G) − G By inspection, holding θ ∈ Θ constant, µ̂n is a solution of the generalized equation n 0 ∈ r̂n (θ, µ̂n ) := 1M rn (ti ; µ̂n ) n i=1 Now let ψ0∗ (ti ; G) ∈ ψ0 (ti ; G) be an arbitrary selection of the limiting best response correspondence, and let " # m0 (ti , G; θ) − m r(ti ; θ, µ) := ψ0∗ (ti ; G) − G It follows that for a given value of θ and under the conditions of the theorem, µ0 is a solution of 0 = r0 (θ, µ0 ) := E [r(ti ; θ, µ)|F∞] Note that the general structure may be similar in other settings, e.g. dynamic games or matching markets, although in these classes of games the equilibrium condition will typically be infinite-dimensional. For the derivation of the asymptotic distribution of m̂n (θ) we define the conditional covariance matrices Ωmm′ := E[(m(Yi ; θ) − m∗0 )(m(Yi ; θ0 ) − m∗0 )′ |F∞ ] Ωψψ′ := diag(G∗0 ) − G∗0 (G∗0 )′ , Ωmψ′ := E[(m(Yi ; θ0 ) − m∗0 )(ψ0 (ti ; G∗0 ) − G∗0 )′ |F∞ ] We also let Ω := " Ωmm′ Ω′mψ′ Ωmψ′ Ωψψ′ # Note that since the empirical distribution of {X1 , . . . , Xn } is Fn -measurable, conditioning on F∞ already implies conditioning on the unordered sample {X1 , . . . , Xn } if the associated conditional variance matrices converge in probability along the filtration {Fn }n≥1. Also let ṀG := ∇G E [m((Xi′ , ψ0 (ti ; G∗0 ))′ ; θ0 )|F∞ ] and P ψ0 (s(1) , ti ; G∗0 ) = {1} .. Ψ̇G := ∇G . (p) ∗ P ψ0 (s , ti ; G0 ) = {1} 26 KONRAD MENZEL We now impose standard regularity conditions on the problem that ensure a joint normal asymptotic distribution for the aggregate state and the moment conditions m̂n (θ): Assumption 4.1. (i) The equilibrium points G∗0 ∈ G0∗ are interior points of ∆S, (ii) the class Ms := {m((Xi′ , s)′ ; θ) : θ ∈ Θ} is Donsker with respect to the distribution of Xi for each s ∈ S with a square-integrable envelope function, and (iii) the eigenvalues of Ω are bounded away from zero and infinity. The first part of this assumption ensures that the aggregate state Ĝn is a regular parameter in the sense of Bickel, Klaassen, Ritov, and Wellner (1993) conditional on any tail event in F∞ . It is possible to show that this condition is satisfied e.g. if Assumption 3.3 (ii) is strengthened to hold with all eigenvalues of ∇t1 u(t, G)∇t′1 u(t, G) bounded away from zero, and if the support of t1 is equal to Rp−1 . Under those additional conditions, for every action s ∈ S and G ∈ ∆S there is a positive mass of types such that s is a best response to G, so that Φ0 (G) ∈ int ∆S, and therefore every fixed point has to be in the interior of ∆S. Parts (ii) and (iii) of this assumption are fairly standard. Under these additional conditions, we can obtain the following CLT: Theorem 4.2. Suppose that Assumptions 3.1-3.3 and 4.1 hold. Then ṀG and (Ip − Ψ̇G )−1 exist and are well defined, and √ d n (m̂n (θ) − E[m(Y1 ; θ)|F∞]) → Z1 + ṀG (Ip − Ψ̇G )−1 Z2 (4.1) mixing and uniformly in θ, where " # Z1 ∼N Z2 " 0 0 # " , Ωmm′ Ωmψ′ Ωψm′ Ωψψ′ #! This theorem is proven in the appendix. Note that despite the asymptotic conditional independence of players’ actions by Theorem 2.1, we need to correct the asymptotic variance for dependence after centering the statistic m̂n (θ) around its conditional limit. The crucial difference to the case in which the unobserved characteristics εi are private information lies in the necessity for adjusting the asymptotic variance of m̂n for endogeneity of Ĝn with respect to the actual realizations of the unobserved types. The correction term is different from the variance adjustment in Shang and Lee (2011)’s analysis of the private information P case, where the realized value of Ĝ∗n = n1 ni=1 si takes the role of a noisy measurement for its expected value in equilibrium. The result can also be easily adapted to settings in which the econometrician only observes a random subsample of N < n players in the game. As before, we let qi be an indicator variable that equals one if agent i is included in the sample, and zero otherwise, and q1 , . . . , qn GAMES WITH EXCHANGEABLE PLAYERS 27 are independent of Y1 , . . . , Yn . We can then define n 1 X m̃N (θ) := qi m(Yi ; θ) N i=1 and apply Theorem 4.2 to m̃N (θ) to obtain its asymptotic distribution. Corollary 4.4. Suppose the assumptions of Theorem 4.2 hold, and we observe a random sample of N ≤ n players’ type-action profiles, where Nn is bounded and limn Nn → α ∈ [0, 1]. Then √ √ d N (m̃N (θ) − E[m(Y1 ; θ)|F∞ ]) → Z1 + αṀG (Ip − Ψ̇G )−1 Z2 mixing and uniformly in θ, where Z1 and Z2 have the same properties as before. This result also helps highlight the different roles played by the number of players in the game and sample size in the asymptotic experiment. Specifically, the variance adjustment accounting for the equilibrium condition is important only if the size of the observed sample is of the same order of magnitude as the number of agents in the market. We now give another extension of the CLT to deal with moment functions of the form ∗ m (Yi ; θ, η) where the equilibrium distribution of actions G takes the role of the nuisance parameter introduced in Corollary 4.2. To be specific, we let the random variable η0 := G∗0 , which is F∞ -measurable, η̂n := Ĝn , and consider the sample moment n m̂∗n (θ, Ĝn ) 1X ∗ m (Yni ; θ, Ĝn ) := n i=1 We further assume that the derivative Ṁη := ∇η E[m∗ (Yi ; θ, η)|F∞] is well-defined a.s. and the assumptions of Theorem 4.2 hold. Then, noting that the total differential of r0 (µ) with respect to G is given by ṀG + Ṁη , the following extension of Theorem 4.2 is immediate from Corollary 4.2 and the proof of the original result: Corollary 4.5. Suppose the assumptions of Theorem 4.2 hold for moment functions of the form m∗ (Yi ; θ, η), where η ∈ ∆S, and we replace Assumption 4.1 (ii) with the requirement that the class Ms := {m((Xi′ , s)′ ; θ, η) : θ ∈ Θ, η ∈ ∆S} be Donsker with respect to the distribution of Xi for each s ∈ S with a square-integrable envelope function. Also assume that the derivative Ṁη := ∇η E[m∗ (Yi ; θ, η)|F∞] exists a.s. and is continuous for η ∈ ∆S. Then h i −1 √ ∗ d n m̂n (θ, Ĝn ) − E[m∗ (Y1 ; θ, G∗0 )|F∞ ] → Z1 + ṀG + Ṁη Ip − Ψ̇G Z2 mixing and uniformly in θ, where Z1 and Z2 have the same properties as before. 4.3. Variance Estimation. We now turn to estimation of the conditional asymptotic variance for m̂n (θ): Given a sample y1 , . . . , yn we can estimate the Jacobians ṀG and Ψ̇G either 28 KONRAD MENZEL ċ and Ψ ḃ . For parametrically given the distribution of ti , or nonparametrically to obtain M G G semi-parametric index models, this derivative can be estimated at a root-n rate in the presence of a continuous observed covariate with a nonzero coefficient, e.g. using Powell, Stock, and Stoker (1989)’s weighted average derivative estimator with constant weights (see their Corollary 4.1), or the estimator proposed by Horowitz and Härdle (1996) if only a finite number of markets are observed. We therefore state the following high-level assumption for consistent variance estimation: Assumption 4.2. (i) For every value of s ∈ S, the first two moments of |m((s, Xi′ )′ ; θ)| are ċ and Ψ ḃ bounded by a constant for all θ ∈ Θ, and (ii) there exist consistent estimators M G G for the Jacobians ṀG and Ψ̇G , respectively. For primitive conditions for part (ii) of this assumption, see e.g. Powell, Stock, and Stoker (1989) and Horowitz and Härdle (1996), where we can combine their consistency arguments with the conditional law of large numbers in Corollary 4.3. Note in particular that identification of the Jacobians using the arguments in Horowitz and Härdle (1996) requires that we observe at least two markets with different limiting values of the aggregate state, either due to differences in type distributions or equilibrium selection. We can now define ċ (I − Ψ ḃ )−1 [e − Ĝ ] v̂in (θ) := [m(yi ; θ) − m̂n (θ)] + M G p G si n (4.2) where ek ∈ {0, 1}p−1 is the kth unit vector if k ≤ p − 1 and the p − 1 dimensional null vector if k = p. A consistent estimator for the asymptotic variance is n V̂n (θ) := 1X v̂i (θ)v̂i (θ)′ n i=1 We also let A := [Iq , (I − Ψ̇′G )−1 ṀG′ ]′ . Our findings are summarized in the following corollary. Corollary 4.6. Suppose the conditions of Theorem 4.2 and furthermore Assumption 4.2 hold. Then V̂n (θ) → A(θ)′ Ω(θ)A(θ) a.s. Furthermore, √ d nV̂n (θ)−1/2 (m̂n (θ) − E[m(Y1 ; θ)|F∞]) → N(0, Iq ) (mixing) See the appendix for a proof. Variance estimation for statistics based on a random subsample of agents as considered in Corollary 4.4 is completely analogous where vin (θ) is replaced with r N ċ ḃ )−1 [e − Ĝ ] M G (Ip − Ψ v̂iN (θ) := [m(yi ; θ) − m̂N (θ)] + G si N n P and ĜN := N1 ni=1 qi esi is the empirical distribution of actions in the observed subsample. GAMES WITH EXCHANGEABLE PLAYERS 29 5. Simulation Study For ease of exposition, we restrict attention to the case of binary choice with social interactions, so that the aggregate state G is a scalar. In order to obtain a parsimonious simulation design that generates multiple equilibria, we model types as including both discrete and continuous components. To be specific, xi = (x1i , x2i ), where the characteristic x1i is a discrete variable which takes values −k or k with probability p each, and zero with probability 1 − 2p, and we set k = 5, p = 0.2, and σx2 = σε2 = 1. Furthermore, the market is partitioned into subgroups of agents, where we allow the strength of social interactions to be of different strength between individuals in the same group than across groups. The component x2i is multinomial with g equally likely categories and serves as an identifier for these subgroups. Finally, there is a characteristic εi ∼ N(0, σε2 ) that is i.i.d. across agents and independent of xi . The component εi is not observed by the econometrician, but the full types ti = (x′i , εi )′ are common knowledge among players. We analyze two different setups, one in which interactions are anonymous, ! n X 1 sj + ε i (5.1) u(ti ; s−i ) := si x′i β + ∆0 n j=1 with a global interaction effect ∆0 . Our second setup allows for a global interaction effect and a “local”, group-level interaction effect, with payoff functions ! n n X X 1 1 u(ti ; s−i ) := si xi + ∆0 sj + ∆1 g sj 1l{x2j = x2i } + εi (5.2) n j=1 n j=1 Here ∆1 measures the group-level interaction effect. For the simulations, we chose parameter values ∆0 = 8, ∆1 = 2, β = 1. Note that since ∆0 and ∆1 are nonnegative, the game is one with strategic complementarities (see Milgrom and Roberts (1990)), and we can use a tâtonnement algorithm to find the smallest and the largest equilibria (which are both in pure strategies) for any size of the market. Since actions are binary, we can implement an algorithm that needs at most n steps. We first illustrate convergence of the best response correspondence underlying the basic argument for conditional convergence for aggregate games in section 3. It is easy to verify that due to the positive sign of the interaction effect ∆0 , ψn (G, t) is single-valued for all finite n and values of G and t. Figure 5 shows the average response correspondence Ψ̂n (G) for a single realization of a market with n = 5, 20 and 100 players, respectively together with the limiting function Ψ0 (G). The limiting best response mapping Φ0 (G) has exactly three fixed points, and for this particular simulation draw, each finite-player version of the game also happens to have exactly three equilibrium values for Ĝn . In general there may be multiple KONRAD MENZEL 1 1 0.9 0.9 0.8 0.8 0.7 0.7 0.6 0.6 b 20 Ψ b5 Ψ 30 0.5 0.5 0.4 0.4 0.3 0.3 0.2 0.2 0.1 0.1 0 0 0 0.2 0.4 0.6 0.8 1 0 0.2 0.4 0.6 0.8 1 0.6 0.8 1 G 1 1 0.9 0.9 0.8 0.8 0.7 0.7 0.6 0.6 Ψ0 b 100 Ψ G 0.5 0.5 0.4 0.4 0.3 0.3 0.2 0.2 0.1 0.1 0 0 0 0.2 0.4 0.6 G 0.8 1 0 0.2 0.4 G Figure 3. Ψ̂n (G) for n = 5, 20, 100, and Ψ0 (G) (bottom right) equilibria supporting the same value of Ĝn , and the probability that the number of distinct equilibrium distributions in the finite player game coincides with the number of fixed points of Φ0 (G) is strictly less than one. We now turn to simulations of statistics of the type analyzed in section 4 and analyze the quality of the asymptotic approximations. The Jacobians in the expression for the variance in equation (4.1) can be obtained if we have an estimator for the conditional marginal effects given Xi , q(x, G) := ∇G E [ψ0 (G; Ti )|Xi = x]. If the conditional distribution of εi given Xi is continuous and fixed with respect to Xi with p.d.f. hε (z), the conditional model for ψ0 (G; Ti ) given Xi has the linear index structure considered by Powell, Stock, and Stoker (1989) and Horowitz and Härdle (1996), and we can obtain consistent estimators for the index ˆ n and β̂n , and for the conditional p.d.f. ĥn (·). We can then form q̂n (x, G) := coefficients ∆ ˆ ĥε (x′ β̂n + ∆ ˆ n G). Since in our setup observable types are discrete and the number of ∆ markets is finite, the conditional marginal effects are in general not point-identified without parametric assumptions on the conditional distribution of εi given Xi . We therefore use GAMES WITH EXCHANGEABLE PLAYERS 31 the usual estimator for q(x, G) based on the Probit specification, q̂(x, G) := ′ parametric ˆ nG ˆn x β̂n +∆ ∆ , where ϕ(·) is the standard normal p.d.f.. ϕ σ σ For the binary action case, let d(x; θ) := m((1, x′ )′ ; θ)−m((0, x′ )′ ; θ) so that we can express ṀG = E [d(X; θ)q(X, G∗0 )|G∗0 ] and Ψ̇G = E[q(X, G∗0 )|G∗0 ] We can then obtain estimates of the Jacobians by replacing q(x, G) with its estimator q̂(x, G), G∗0 with Ĝn , and expectations with sample averages over the observed values of Xi , n ċ = 1 X d(X ; θ)q̂(X , Ĝ ) M i i n G n i=1 n ḃ = 1 X q̂(X , Ĝ ) and Ψ G i n n i=1 Given a consistent estimator q̂(x, G) for the conditional marginal effect, we can plug these estimators into the expression for v̂in in equation (4.2) and obtain a consistent estimator for the variance matrix of m̂n (θ). Below we report simulation results for the value of the aggregate state Ĝn and the maxiˆ n for the interaction effect ∆0 based on the limiting distribution. mum likelihood estimator ∆ Note that this estimator has a higher-order bias of order O(n−1 ) which becomes dominated by the (asymptotically Gaussian) first-order term in the asymptotic expansion. We also report simulation results for the Jackknife MLE which removes part of the leading bias term, where we use a parametric bootstrap bias correction for the standard errors that is asymptotically unbiased to second order. For results on the distribution of Ĝn , a given market picks the lowest equilibrium with probability 0.3, and the highest equilibrium with probability 0.7. The MLE is computed for samples of m = 16 markets with n players each, where in half the markets the lowest equilibrium is chosen, and in the remaining games players coordinate on the highest equilibrium. Note that for some realizations of types, especially for small values of n, the equilibrium is unique so that the lowest and the highest equilibria coincide. Figure 5 shows the p.m.f. for Ĝn for games of different sizes. Note that the two extremal equilibria for games with strategic complementarities are always in pure strategies, so that in our simulation design, the distribution of Ĝn is always discrete with values n1 , n2 , . . . , 1, regardless whether types are discrete or continuous. The narrow spikes and gaps in the histograms for Ĝn are “interference patterns” that are caused by the discreteness of the variable and not a result of simulation error. These spikes average out between neighboring values for Ĝn and appear to vanish for larger values of n. Figures 5 and 5 show kernel density ˆ n and the corresponding t-ratios. approximations to the simulated p.d.f.s for Ĝn and ∆ Finally, we simulate versions of the model in which agents also interact “locally.” In this setting, each agent in each market belongs to one of g different groups and has preferences given by (5.2). The interaction effects are held at ∆0 = 8 and ∆1 = 2, respectively, and our simulation experiments consider the cases of g = 4 and g = 10 subgroups. We consider a “fixed design” with respect to the marginal distribution of x2i in each market in that 32 KONRAD MENZEL 0.14 0.12 0.25 0.1 density density 0.2 0.15 0.08 0.06 0.1 0.04 0.05 0.02 0 0.1 0.2 0.3 0.4 0.5 Ĝn 0.6 0.7 0.8 0.9 0 1 0.09 0.1 0.2 0.3 0.4 0.5 Ĝn 0.6 0.7 0.8 0.9 1 0.06 0.08 0.05 0.07 0.04 density density 0.06 0.05 0.04 0.03 0.03 0.02 0.02 0.01 0.01 0 0 0.2 0.4 0.6 0.8 0 1 0 0.2 0.4 Ĝn 0.6 0.8 1 Ĝn Figure 4. Distribution of Ĝn for n = 10, 20, 50, and n = 100, respectively n=10 n=20 n=50 n=100 5.5 5 0.35 4.5 4 0.3 density 3.5 density n=10 n=20 n=50 n=100 Gaussian 0.4 3 2.5 2 0.25 0.2 0.15 1.5 0.1 1 0.05 0.5 0.1 0.2 0.3 0.4 0.5 Ĝn 0.6 0.7 0.8 0.9 1 −4 −3 −2 −1 0 Ẑn 1 2 3 Figure 5. Simulated p.d.f. of Ĝn (left panel), and its studentization Ẑn = √ Ĝn −G∗0 (right panel) for n = 10, 20, 50, 100. n√ V̂G,n 4 5 GAMES WITH EXCHANGEABLE PLAYERS 33 0.4 0.7 n=50 n=100 n=500 Gaussian 0.6 n=50 n=100 n=500 Gaussian 0.35 0.3 0.5 density density 0.25 0.4 0.2 0.3 0.15 0.2 0.1 0.1 0 0.05 −4 −3 −2 −1 0 Ẑn 1 2 3 4 5 −3 −2 −1 0 Ẑn 1 2 3 4 Figure 6. Simulated p.d.f. of the studentized MLE for ∆0 (left) from m = 12 markets with n = 50, 100, 500 players, and the (Jackknife) bias-corrected MLE (right) the proportion of players belonging to each group is held constant across replications of the game.11 Fixing the marginal distribution for x2i allows us to reduce the relevant dimension of the aggregate G from 2g−1 to g which speeds up the matrix inversions involved in computing the asymptotic conditional variance of Ĝn . The results are reported in Figure 5, and there appears to be a visible deterioration of the performance of asymptotic approximation as we increase g. 6. Discussion This paper establishes methods for conditional estimation and inference using data from games with a large number of players. The argument relies crucially on the idea of exchangeability of individual agents, and addresses difficulties with stochastic dependence arising from strategic behavior and multiplicity of equilibria. We also argue that distributions arising in the many-player limit may also be computationally more tractable. We discussed how to apply the asymptotic results to static aggregate games with discrete actions. It is possible to extend some of the main ideas of this paper dynamic games, matching markets, network formation and other problems of non-anonymous strategic interaction. However, this is beyond the scope of this study and will be left for future research. Appendix A. Proofs for Results from Section 3 A.1. Proof of Proposition 3.1. We start by establishing claim (a), where the proof proceeds in the following steps: we first construct a convex-valued correspondence Ψ̃n (G) and show uniform convergence in probability of the convex hull of Ψ̂n (G) to Ψ̃n (G). We then 11Note that the empirical distribution of types is also Fn -measurable, so that conditioning on type distributions, rather than individual realizations, preserves exchangeability of type-action characters. 34 KONRAD MENZEL 12 n=50 n=100 n=200 n=500 10 n=50 n=100 n=200 n=500 Gaussian 0.35 0.3 0.25 density density 8 6 0.2 0.15 4 0.1 2 0 0.05 0.1 0.2 0.3 0.4 0.5 Ĝn 0.6 0.7 0.8 0.9 1 −4 −3 −2 −1 0 Ẑn 1 2 3 4 5 −2 −1 0 Ẑn 1 2 3 4 5 0.4 n=50 n=100 n=200 n=500 n=50 n=100 n=200 n=500 Gaussian 0.35 10 0.3 8 0.25 density density 12 6 0.2 0.15 4 0.1 2 0 0.05 0.1 0.2 0.3 0.4 0.5 Ĝn 0.6 0.7 0.8 0.9 1 −4 −3 Figure 7. Simulated p.d.f. of Ĝn (left panel), and its studentization Ẑn = √ Ĝn −G∗0 (right panel) with g = 4 (top panels) and g = 10 (bottom) interacn√ V̂G,n tion groups for n = 50, 100, 200, 500. show graphical convergence of the correspondence Ψ̃n (G) to Ψ0 (G). Finally, we conclude that any sequence of solutions Ĝ∗n of the fixed point inclusion Ĝ∗n ∈ Ψ̂n (Ĝ∗n ) approaches the set G0∗ with probability one. Recall that the correspondence n 1M Ψ̂n (G) := ψn (G, ti ) n i=1 where “ L ” denotes the Minkowski sum over the sets ψ̃n (s; G, ti ) for i = 1, . . . , n. Also let Ψ̃n (G) := E [ψn (G, Ti )] where the operator E[·] denotes the (Aumann) selection expectation with respect to the random type Ti ∼ H0 . Note that from standard properties of Aumann expectations, Ψ̃n (G) is convex at every value G ∈ ∆S. GAMES WITH EXCHANGEABLE PLAYERS 35 Uniform Convergence. We will now show uniform convergence of the correspondence Ψ̂n to Ψ̃n in probability. In the following, we denote the support function of the set Ψ̂n (G) by ĥn (v; G, ε) := inf G′ ∈Ψ̂n (G) hv, G′i for any v on the (p − 1) dimensional unit sphere in Rp , and we also let h̃n (v; G, ε) := inf G′ ∈Ψ̃ n (G) hv, G′i Since for the Minkowski sum of two convex sets K1 and K2 , the support function satisfies h(a1 K1 ⊕ b2 K2 , v) = a1 h(K1 , v) + b2 h(K2 , v), we can rewrite n 1X ̺n (v; G, ti ) ĥn (v; G) = n i=1 where at a given value of G and type t, ̺n (v; G, t) := inf G′ ∈ψn (G,t) hv, G′ i = inf G′ ∈conv ψn (G,t) hv, G′i is the support function of the set ψn (G, t). In order to show that ĥn converges to h̃n uniformly, i.e. p sup ĥn (v; G) − h̃n (v; G) → 0 v,G note first that the sets conv ψn (G, ti ) are convex polytopes whose vertices are equal to one or zero, yielding at most 2p different sets for conv ψn (G, ti ; ε). Hence we can represent the support function ̺n (v; G, t) as a simple function defined on sets that are p fold intersections of the sets A(s, G) and their complements. Since the VC property is preserved under complements and finite intersections, Assumption 3.1 implies that ̺n (v; G, t) is a VC class of functions indexed by (v, G) for every n with a VC index that does not depend of n. Now by the same argument as in the proof of Theorem 3.1.21 in Molchanov (2005), convergence of the support functions in probability implies convergence in probability of P the set conv n1 ni=1 ψn (G, ti ) with respect to the Hausdorff metric, where convergence is uniform in G by boundedness of ̺n (v; G, t) and the VC property of the support functions for ψn (G, ti ). Graphical Convergence. Now fix δ > 0. In order to establish that Ψ̃n (G) = E [ψn (G, t)] converges to Ψ0 (G) = E [ψ0 (G, t)] graphically, we show that for n large enough the two δ inclusions gphΨ0 ⊂ gphΨ̃n and gphΨ̃n ⊂ (gphΨ0 )δ hold: First, consider a point G, Ψ such that Ψ ∈ Ψ0 (G), where Ψ = (Ψ(s(1) ), . . . , Ψ(s(p) )) ∈ ∆S. From the definition of G it follows that P (t ∈ A(s, G)) ≥ Ψ(s) and P (t ∈ / A(s, G)) ≥ 1−Ψ(s) for all s. Now consider the extremal points Ψ∗p of Ψ0 (G1 ) of the form Ψ∗p (s(p) ) = P (t ∈ P A(s(p) , G1 )) for p = 1, . . . , S −1, and s′ 6=s(p) Ψ∗p (s′ ) = 1−P (t ∈ A(s(p) , G1 )). It is immediate that the values Ψ ∈ Ψ0 (G1 ) are convex combinations of Ψ∗p , p = 1, . . . , S − 1. 36 KONRAD MENZEL Now let B be the closed δ/2 ball around G1 . Then for all sufficiently large n, the δ/2 blowup S of the set G∈B conv (Eψn (G, t)) includes the extremal points of Ψ0 (G1 ) and their convex hull. In particular, we can find a point G̃1 ∈ B and Ψ̃1 ∈ Ψ̃n (Gn ) such that d(Ψ̃1 , Ψ1 ) < δ/2. Hence for large n we can find (G̃1 , Ψ̃1 ) ∈ gph Ψ̃n such that d((G1, Ψ1 ), (G̃1 , Ψ̃1 )) < δ. Since δ this bound does not depend on G, Ψ, it follows that gphΨ0 ⊂ gphΨ̃n . The argument for the second inclusion is completely analogous, so that we can conclude that for n large enough, dH (gphΨ0 , gphΨ̃n ) < δ. Convergence to Limiting Points. Given δ > 0 we define η := inf {dH (G, Ψ0 (G)) : G ∈ ∆S : d(G, G0∗ ) ≥ δ} By Lemma 3.2, the number of elements in G0∗ is finite, so that since ∆S is compact and dH (G, Ψ0 (G)) is lower-semi-continuous in G, we have that η > 0, where existence of the minimum follows e.g. by Theorem 1.9 in Rockafellar and Wets (1998). Combining uniform convergence of Ψ̂n to Ψ̃n in probability, and graphical convergence of Ψ̃n to Ψ0 via the triangle inequality, we can choose n large enough such that the probability of dH (gphΨ̂n , gphΨ0 ) < min{δ, η} is arbitrarily close to one. Hence the probability that there exists a solution of the inclusion G̃ ∈ Ψ̂n (G̃) with d(G̃, G0 ) > δ converges to zero as n increases. Finally, since any Ĝn solving Ĝn ∈ Ψ̂n (Ĝn ) also solves Ĝn ∈ Ψ̃n (Ĝn ) for n large enough, we have that d(Ĝn , G0 ) < δ with probability approaching one, which establishes assertion (a). Achievability of Limiting Equilibria. We now turn to the proof of part (b). We proceed by showing that for every equilibrium point G∗0 ∈ G0∗ , we can find a neighborhood of G∗0 such that Ψ0 (G) is an inward map on that neighborhood. Then we show that by graphical convergence, Ψ̂n (G) is also an inward map on all such neighborhoods with probability approaching one. We then apply a fixed-point theorem for inward maps to a transformation of the problem and conclude that with probability approaching 1, local solutions exist near every G∗0 ∈ G0∗ . Construction of Neighborhoods. Let Γ0 (G) := Ψ0 (G) − G be defined as in the main text. In the following, we say that Ψ0 is a (possibly multi-valued) inward map on a convex set K ⊂ ∆S, if for all G ∈ K, Ψ(G) ∩ (G + TK (G)) 6= ∅, where TK (G) is the tangent cone of K at G. We will now show that for every G∗0,j ∈ G0∗ , j = 1, . . . , |G0∗ |, and neighborhood U(G∗0,j ) of G∗0,j , we can find a compact convex set Kj ⊂ U(G∗0,j ) containing an open set around G∗0,j , and such that Ψ0 restricted to Kj is an inward map. To this end suppose without loss of GAMES WITH EXCHANGEABLE PLAYERS 37 generality that for that every vector u 6= 0 and v ∈ DΓ0 (G∗0,j , G∗0,j )(u), we have that the inner product u′ v < 0.12 We will prove this claim by contradiction: Suppose that for a sequence hn → 0 we can construct a sequence of closed balls Kn of radius hn around the fixed point G∗0,j such that for every n there exists Gn on the boundary of Kn such that Ψ0 (Gn ) ∩ (Gn + TKn (Gn )) = ∅. In particular, Ψ0 (Gn ) − Gn ⊂ [TKn (Gn )]C , where the superscript C denotes the complement of a set, and without loss of generality we will take Ψ0 (Gn ) to be single-valued. Now, let Ψ̃0n (G) denote the least-squares projection of Ψ0 (G) onto Knhn , the hn -blowup of C C Kn - note that if Ψ0 (G) − G ⊂ [TKn (G)] , then also Ψ̃0n (G) − G⊂ [TKn (G)] . Now consider Gn − G∗0,j , Ψ̃0n (Gn ) − Gn in Kn × Knhn . Since this the sequence of vectors vn := h−1 n sequence is contained in the closed ball of radius 2 - a compact set - there exists a converging subsequence vn(ν) , ν = 1, 2, . . . and n(ν) → ∞, where limν→∞ vn(ν) =: v ≡ (vG , vΨ ). Since 0 ∈ Ψ̃0n (G∗0,j ) − G∗0,j by assumption, v is also an element of the graph of the contingent derivative of Ψ0 (G) − G at (G∗ , G∗ ). Furthermore, Kn is convex, so that G∗0,j − Gn ∈ TK (Gn ). Since Kn is a ball, at every point Gn on the boundary of Kn TKn (Gn ) is an open half-space, implying that [TKn (Gn )]C = −int(TKn (Gn )), where int(A) denotes the interior of a set A. Now, by construction Gn − Ψ̃0n (Gn ) ∈ [TK (Gn )]C , so that the inner product (Gn − G∗0,j )′ (Ψ̃0n (Gn ) − Gn ) > 0 for all n. ′ Hence, taking limits we have vG vΨ > 0. However, this contradicts that for the contingent ∗ ∗ derivative of Ψ0 at (G0,j , G0,j ) we had u′ v < 0 whenever v ∈ DΓ0 (G∗0,j , G∗0,j )(u), so that Ψ0 has to be inward as claimed before. Since Kn is compact, and Ψ0 (G)−G is continuous, we have that the length of the projection of Ψ0 (G) − G onto TKn (G) is bounded away from zero as G varies over Kn . Noting that the convex hull of Ψ̂n (G) is lower semi-continuous and converges to Ψ0 (G) graphically in probability, we have that Ψ̂n is also inward on K1 , . . . , K|G0∗ | with probability approaching one. Local Existence of Fixed Points of Ψ̂n . In order to remove the non-convexities of the mapping Ψ̂n (G), we consider a transformation of the graph of Ψ̂n on the set Kj under a continuous one-to-one mapping Hj : Kj × Kj → G × G, [G, Ψ] 7→ [Hj (G, Ψ − G), Ψ]. We can choose the mapping Hj such that Hj (G, Ψ − G) is strictly monotone in its second argument, and 12More specifically, we can choose a diagonal matrix B with elements Bjj ∈ {−1, 1}, j = 1, . . . , p − 1, and rewrite the inclusion as G∗0,j − B(G − G∗0,j ) ∈ Ψ0 (G∗0,j − B(G − G∗0,j )) for any such B, or equivalently, G = G∗0,j + (G − G∗0,j ) ∈ Ψ0 (G∗0,j − B(G − G∗0,j )) + (I − B)(G − G∗0,j ) =: Ψ̃B (G) Now if we let ΓB (G) := Ψ̃B (G) − G, by Assumption 3.2 (ii) the contingent derivative of ΓB is surjective, and we can choose B such that there exists a vector vB ∈ gph DΓB (G∗0,j , G∗0,j )(u), the inner product u′ · vB ≤ 0. The second part of Assumption 3.2 (ii) implies that this inequality is strict, and has the same sign for any ′ other vB ∈ gph DΓB (G∗0,j , G∗0,j )(u) by convexity of the contingent derivative. 38 KONRAD MENZEL Hj (G, 0) = G, and the values of (Hj (G, Ψ̂n (G)), Ψ̂n (G)) are convex for every G ∈ ∆S. Note that the two conditions on Hj (·) imply that G is a fixed point of Ψ̂n if and only if it is also a fixed point of (Hj (G, Ψ̂n − G), (Ψ̂n (G)). Furthermore, since the transformations Hj map any point on the boundary of Kj onto itself, they also preserve the inward mapping property of Ψ̂n . Since the transformed mapping is nonempty with closed and convex values, upper semicontinuous, and inward on Kj , existence of a fixed point in Kj follows from Theorem 3.2.5 in Aubin and Frankowska (1990). Since the number of sets K1 , . . . , Ks is finite, and the set of fixed points of the transformed mapping is equal to that of Ψ̂n , it follows that with probability approaching one, Ψ̂n has a fixed point in the neighborhood of each equilibrium point G∗0,j ∈ G0∗ , establishing the conclusion in part (b) A.2. Proof of Lemma 3.3. We first consider case (a). Let t̃ := (t̃1 , t̃2 ) ∈ T1 ×T2 such that a player of type t̃ is indifferent between two or more actions given the aggregate state G. Since by Assumption 3.3, the type distribution is tight, for a fixed δ > 0, we can find a compact subset K ⊂ T1 such that t1i ∈ K with conditional probability greater than 1−δ given t2i = t̃2 . Since by Assumption 3.3 (ii), the Jacobian ∇t1 u(ti , G) has rank equal to p − 1, there exists a neighborhood N(t̃1 ) of t̃1 such that for players of any type t = (t1 , t̃2 ), t1 ∈ N(t̃1 ) \ {t̃1 } there is a unique best response give the aggregate value G. Without loss of generality we can take the family of neighborhoods N(t̃1 ) to be a cover of K. Since K is compact, there is a finite subcover, and since the N(t̃1 ) were constructed in a way such that each set contains at most one value t1 such that agents with type (t′1 , t̃′2 )′ are indifferent between any two actions, there exist only finitely many values of t1 ∈ K such that u((t′1 , t̃2 )′ , s, G) = u((t′1 , t̃2 )′ , s′ , G) for some s 6= s′ . Since δ can be chosen arbitrarily small, by the law of total probability there cannot be a strictly positive mass of types that are indifferent between any two actions at G. Hence with probability 1, ψ0 (ti ; G) is a singleton, and the correspondence Ψ0 (G) is in fact single-valued. For case (b), it is sufficient to notice that due to strict monotonicity of the payoff differences u(s(l) , t, G) − u(s(k) , t, G), for every value of t̃2 there is at most one value of t1 such that a player of type (t1 , t̃′2 )′ is indifferent between actions s(k) and s(l) . Since the number of actions is finite, the conditional probability of a tie given t2 = t̃2 is zero, and the conclusion follows from the law of total probability Appendix B. Proofs for Results from Section 4 B.1. Proof of Corollary 4.2. Note that Condition 4.1 does not depend on the form of the moment function, and the theorem already includes a strengthened version of Condition 4.3 that holds uniformly in η. Hence it only remains to be shown that 4.2 (iii) holds for m∗ (Yi ; θ, η̂n ): Since m∗ ((s, Xi ); θ, η) is continuous in η for all s, we have that GAMES WITH EXCHANGEABLE PLAYERS 39 m∗ ((Yni , Xi ); θ, η) is also continuous, and by the continuous mapping theorem, we can find a null sequence dn → 0 such that |E[m∗ (Yni ; θ, η̂n ) − m∗ (Yni ; θ, η0 )|Fn ]| < dn a.s.. Hence, condition 4.2 (iii) for m∗ (Yi ; θ, η̂n ) follows from the corresponding restriction on m(Yi ; θ) := m∗ (Yi ; θ, η0 ) and the triangle inequality. Finally, note that since η0 is F∞ -measurable, it is also Fn -measurable. Hence n Zn∗ n 1X ∗ 1X := m (Yni ; θ, η̂n ) + [m(Yi ; θ, η0 ) − m∗ (Yni ; θ, η̂n )] n i=1 n i=1 is also a reverse martingale adapted to {Fn }n≥1 and satisfying the Conditions of Theorem 4.1, so that the claim of the corollary follows from the same steps as before B.2. Proof of Proposition 4.1. Let ν1 , ν2 , . . . be a sequence of i.i.d.n draws from the uni- o P form distribution on the (p−1)-dimensional probability simplex ∆S := π ∈ Rp+ : pq=1 πq = 1 , where p is the number of n pure actions available to each player. Then (ν1′ ,o. . . , νn′ )′ is uniformly n(p−1) Pp distributed on (∆S)n = π ∈ R+ : q=1 πiq = 1 for all i = 1, . . . , n . Now fix t ∈ T n and let λ∗ (t) be the distribution over equilibria in Σ∗ (t) that generates the mixture fN∗ (s, x|θ). Without loss of generality we can assume that λ∗ (t) is symmetric with respect to permutations among players with the same type ti . Then λn constitutes a partition of (∆S)n that can be chosen to be symmetric with respect to the same permutations of indices 1, . . . , n, and therefore satisfies Condition 4.2 (i).13 In particular, the resulting distribution of type-action characters, f (s, x|θ, λN ) = fN∗ (s, x|θ) for all s ∈ S. We now prove parts (ii) and (iii) of Condition 4.2. By Proposition 3.1, we have that the a.s. equilibrium value of the aggregate d(Ĝ∗n , G0∗ ) → 0, where G0∗ is a finite set. We now show that in order to characterize the the set of distributions of Y1 it is sufficient to consider a family f (y1 |θ, µ), where the parameter space for µ is the |G0∗ |-dimensional probability simplex. More specifically, fix n0 , ν1 , . . . , νn0 and t1 , . . . , tn0 , and let G̃∗ := arg minG∈G0∗ d(G, Ĝn ), where Ĝn is the value of the equilibrium aggregate under the selection rule λn . Now fix ε > 0. By Proposition 3.1 (b), there exists n0 large enough such that with probability 1 − ε we can find an equilibrium that supports some Ĝ∗n for all n ≥ n0 such that d(Ĝ∗n , G̃∗ ) < rn for a sequence rn → 0. Hence we can choose λn such that for the first n0 coordinates of ν1 , . . . , νn and t1 , . . . , tn coinciding with ν1 , . . . , νn0 and t1 , . . . , tn0 , respectively, we have Ĝn = Ĝ∗n . 13Note that we can identify the vertices of (∆S)n and elements of the partition containing them with profiles in S by setting si = q if νiq = 1 for all i = 1, . . . , n and q = 1, . . . , p. Denoting ιn(p−1) = (1, 1, . . . , 1)′ an n(p − 1)-dimensional vector of ones, we can now construct a partition by considering sections around the central point π ∗ := p1 ιn(p−1) that are symmetric with respect to any permutation of individuals of the same type, and such that for every strategy profile s ∈ S n the set containing s̄ has volume λ∗ (s̄) (possibly zero). Finding such a partition is a finite-dimensional convex program with linear constraints so that existence and uniqueness of the solution are not problematic. n 40 KONRAD MENZEL Now let M := supθ∈Θ maxs,s′∈S |E[m((s, x); θ) − m((s′ , x); θ)]|, which is finite by Condition 4.3. Now we can bound E [ |m(Yn1 ; θ) − m(Y1 ; θ)|| Fn ] ≤ MP (Sn1 6= S1 |Fn ) Now notice that by Assumption 3.3 and Lemma 3.3, the set ψ0 (G̃∗ , ti ) is the singleton {Si } with probability one. Therefore since t1 , . . . , tn are i.i.d. draws from H0 (t), we have n o 1X n 1l ψ0 (G̃∗ , ti ) 6= ψ0 (Ĝ∗n , ti ) ≤ 2P (ψ0 (G̃∗ , t1 ) 6= ψ0 (Ĝ∗n , t1 )) P (Sn1 6= S1 |Fn ) = n i=1 say, w.p.a.1 by the strong law of large numbers. Now notice that by Assumption 3.3, P (ψ0 (G̃∗ , ti ) 6= ψ0 (G, ti )) is continuous in G so that, since Ĝ∗n → G̃∗ , we can make P (ψ0 (G̃∗ , ti ) 6= ψ0 (Ĝ∗n , ti )) arbitrarily small by choosing n large enough, establishing Condition 4.2 part (iii) B.3. Proof of Theorem 4.2. First, note that under Assumption 3.3, Ψ0 (G) is single-valued at every point G ∈ ∆S by Lemma 3.3. Hence, the Aumann selection expectation defining Ψ0 (G) coincides with the usual expectation of a vector-valued random variable, and under Assumption 3.3 (i) the mapping G 7→ G − Ψ0 (G) is Fréchet differentiable with continuous derivative Ip−1 − ∇G P (ψ0 (s, ti ; G) = {1}). Similarly, by Assumption 3.3 (iii), the derivative of the conditional probability ṖG (Xi ) := ∇G P ( ψ0 (s, ti ; G0 ) = {1}| Xi ) exists and is continuous in G a.s.. Hence we can interchange differentiation and integration, so that by the law of iterated expectations i X h Ṁ0 (G) := ∇G E[m0 (ti , G; θ)] := E m((Xi′ , s)′ ; θ)ṖG (Xi ) s∈S Since by Assumption 4.1 (ii), the functions m((Xi′ , s)′ ; θ) are dominated by a square integrable envelope for all s ∈ S, we can use the Cauchy-Schwarz inequality to establish that the elements of ∇G E[m0 (ti , G; θ)] are bounded and continuous in G in a neighborhood of G∗0 . We will now verify the conditions for Theorem 3.3.1 in van der Vaart and Wellner (1996) √ to derive the asymptotic distribution of n(µ̂ − µ0 ). Note that for notational simplicity, we suppress the θ argument below whenever θ can be regarded as fixed. We can decompose the difference between the maps r̂n (µ) := r̂n (θ, µ) and r0 (µ) := r0 (θ, µ) by ! n n M X √ 1 [rn (ti ; µ0 ) − r(ti ; µ0 )] + [r(ti ; µ0) − E [r(ti ; µ0 )|F∞ ]] n(r̂n (µ0 ) − r0 (µ0 )) = √ n i=1 i=1 =: T1 + T2 GAMES WITH EXCHANGEABLE PLAYERS 41 We first show that the first term is asymptotically negligible: let rn∗ (ti ; µ0 ) be any mea surable selection of rn (ti ; µ0 ). If we let Bn := G′ ∈ ∆S : kG0 − G′ k ≤ n1 and πn (µ0 ) := S P ψ0 (ti ; G0 ) 6= G′ ∈Bn ψ0∗ (ti ; G′ ) , then we can bound Var (rn∗ (ti ; µ0 ) − r(ti ; µ0 ))2 |Fn ≤ πn (µ0 )(1 − πn (µ0 )) Similarly, we can bound |E[rn∗ (ti ; µ0 ) − r(ti ; µ0)|F∞ ]| ≤ πn (µ0 ) Now notice that, as shown above Assumption 3.3 implies that at every value of G, there are no atoms of ”switchers” in the type distribution. Since the density of the type distribution is bounded from above, we therefore have that limn nπn (µ0 ) < ∞ for each µ0 . Since with probability one, µ0 only takes finitely many values, the common upper bound for all µ0 is finite for all F∞ -measurable events. Therefore the variance and absolute value of the L expectation of the first term T1 := √1n ni=1 [rn∗ (ti ; µ0 ) − r(ti ; µ0 )] converge to zero almost surely as n → ∞, so that applying Chebyshev’s inequality, T1 converges to zero in probability conditional on F∞ . Since ψ0 (ti , G∗0 ) is single-valued with probability 1, and ti are i.i.d. draws from the distribution h(t), we can apply a standard CLT and obtain n √ 1X d [r(ti ; µ0 ) − r0 (µ0 )] → N(0, Ω) n n i=1 (B.1) Since conditional on F∞ , T1 → 0 in probability, convergence in distribution with mixing follows from Slutsky’s theorem. Next, consider the Jacobian of the (augmented) population moments " # Iq −Ṁ0 (G∗0 ) ∇µ r0 (µ) = − 0 Ip−1 − Ψ̇0 (G∗0 ) By Assumption 3.2, the derivative Ip−1 − Ψ̇0 (G∗0 ) has full rank at every G∗0 ∈ G0∗ , and we already showed above that Ṁ0 (G) and Ψ̇0 (G) are continuous in G. Due to the block-diagonal structure it is therefore straightforward to verify that the inverse of ∇µ r0 (µ) is well-defined and continuous in µ in a neighborhood around µ0 and for all values of θ ∈ Θ. We will now check the conditions of Lemma 3.3.5 in van der Vaart and Wellner (1996). By Assumption 3.1 (iii), the class T := {ψ0 (ti ; G) : G ∈ ∆S} is a VC subgraph class, and therefore Donsker. Also, by Assumption 4.1, Ms := {m((x, s); θ), θ ∈ Θ} is a Donsker class for each s ∈ S, and S is finite. Note that the class R := {r(ti , µ, θ), µ ∈ Rq × ∆S, θ ∈ Θ} is a Lipschitz transformation of M1 , . . . , Mp and T with the same envelope function as M1 . Hence, by Theorem 2.10.6 in van der Vaart and Wellner (1996), Z is also Donsker with respect to the distribution of types. 42 KONRAD MENZEL Note also that as shown before, ψ0 (s, ti ; G) is single-valued at every G except for a set of types of measure zero. Since payoffs are continuous in t and G, and the p.d.f. of the type distribution is continuous in t, it follows that Ekr(ti ; µ) − r(ti ; µ0 )k2 is also continuous in µ. Finally, by Proposition 4.1, we can construct the coupling such that conditional on F∞ , Ĝn − G0 → 0 in outer probability, so that by Lemma 3.3.5 in van der Vaart and Wellner (1996) we have that for the empirical process √ kGn (r(·, µ) − r(·, µ0))k = o∗P 1 + nkµ − µ0 k Hence we can apply Theorem 3.3.1 in van der Vaart and Wellner (1996) to obtain the √ asymptotic distribution of n(µ̂n − µ0 ). It remains to show uniformity with respect to θ. Note that since R is Donsker as θ varies in Θ, convergence of r̂n (θ, µ) is also uniform in θ. Finally, since by Assumption 4.1 (ii), the class Ms has a square-integrable envelope function, by the same arguments as before the elements of the derivative ṀG are bounded so that the bounds in the previous arguments do not depend on θ B.4. Proof of Corollary 4.6. By the same reasoning as in the proof of Theorem 4.2, the matrices ṀG and (I − Ψ̇G )−1 are well-defined, so that by Assumption 4.2 (i), the first P two moments of |vi | are bounded. Therefore, Corollary 4.3 implies that n1 ni=1 vi vi′ → E[vi vi′ |F∞ ] a.s.. Next, the covariance matrix of (m(Yi ; θ)′ , e′si )′ is given by Ωn (Ĝn ) where Ωn (G) := Var(m((Xi′ , ψn∗ (ti ; G)′ )′ ; θ), ψn∗ (ti ; G)) for some selection ψn∗ (ti ; G) ∈ ψn (ti ; G). By similar arguments as in the proof of Theorem 4.2, Ωn (G) → Ω(G) uniformly in G. Note that Ω(G) is continuous in G and Ĝn → G∗0 a.s., so that Ω(Ĝn ) → Ω by the continuous mapping theorem. Since by Assumption 4.2, the estimated Jacobians converge to Ṁg and Ψ̇G , respectively, we therefore have that the conditional expectation of the product vi vi′ converges to A′ ΩA. By Assumption 4.2 (i) we therefore obtain V̂n → A′ ΩA almost surely, as claimed in the first claim of the Corollary. The second part follows immediately from Theorem 4.2 and Slutsky’s Lemma References Acemoglu, D., and M. Jensen (2009): “Aggregate Comparative Statics,” working paper, MIT and University of Birmingham. Al-Najjar, N. (2006): “Large Games and the Law of Large Numbers,” Games and Economic Behavior, 64, 1–34. Andrews, D. (2005): “Cross-Section Regression with Common Shocks,” Econometrica, 73(5), 1551–1585. Aubin, J., and H. Frankowska (1990): Set-Valued Analysis. Birkhäuser. GAMES WITH EXCHANGEABLE PLAYERS 43 Bajari, P., L. Benkard, and J. Levin (2007): “Estimating Dynamic Models of Imperfect Competition,” Econometrica, 75(5), 1331–1370. Bajari, P., J. Hahn, H. Hong, and G. Ridder (2011): “A Note on Semiparametric Estimation of Finite Mixtures of Discrete Choice Models with Application to Game Theoretic Models,” International Economic Review, 52(3), 807–824. Banerjee, A., E. Duflo, M. Ghatak, and J. Lafortune (2009): “Marry for What? Caste and Selection in Modern India,” working paper, MIT, LSE, and University of Maryland. Beresteanu, A., I. Molchanov, and F. Molinari (2009): “Sharp Identification Regions in Models with Convex Predictions: Games, Individual Choice, and Incomplete Data,” cemmap working paper CWP27/09. Bickel, P., A. Chen, and E. Levina (2011): “The Method of Moments and Degree Distributions for Network Models,” Annals of Statistics, 39(5), 2280–2301. Bickel, P., C. Klaassen, Y. Ritov, and J. Wellner (1993): Efficient and Adaptive Estimation for Semiparametric Models. Springer. Blum, J., H. Chernoff, M. Rosenblatt, and H. Teicher (1958): “Central Limit Theorems for Interchangeable Processes,” Canadian Journal of Mathematics, 10, 222–229. Bresnahan, T., and P. Reiss (1991): “Entry and Competition in Concentrated Markets,” Journal of Political Economy, 99(5), 977–1009. Brock, W., and S. Durlauf (2001): “Discrete Choice with Social Interactions,” Review of Economic Studies, 68. (2007): “Identification of Binary Choice Models with Social Interactions,” Journal of Econometrics, 140. Ciliberto, F., and E. Tamer (2009): “Market Structure and Multiple Equilibria in Airline Markets,” Econometrica, 77(6), 1791–1828. Diaconis, P., and S. Janson (2008): “Graph Limits and Exchangeable Random Graphs,” Rendiconti di Matematica (7), 28, 33–61. Dudley, R. (2002): Real Analysis and Probability. Cambridge University Press, Cambridge. Galichon, A., and M. Henry (2011): “Set Identification in Models with Multiple Equilibria,” Review of Economic Studies, forthcoming. Glaeser, E., B. Sacerdote, and J. Scheinkman (1996): “Crime and Social Interactions,” Quarterly Journal of Economics, 111, 507–548. Graham, B. (2008): “Identifying social interactions through conditional variance restrictions,” Econometrica, pp. 643–660. Grieco, P. (2012): “Discrete Games with Flexible information Structures: An Application to Local Grocery Markets,” working paper, Penn State. 44 KONRAD MENZEL Hall, P., and C. Heyde (1980): Martingale Limit Theory and its Application. Academic Press, New York. Harsanyi, J. (1973): “Oddness of the Number of Equilibrium Points: A New Proof,” International Journal of Game Theory, 2(1), 235–250. Hausman, J. (1983): Specification and Estimation of Simultaneous Equations Modelspp. 391–448. in: Grilliches and Intrilligator (eds.): Handbook of Econometrics, Vol. I. Hildenbrand, W. (1971): “Random Preferences and Equilibrium Analysis,” Journal of Economic Theory, 3, 414–429. Horowitz, J., and W. Härdle (1996): “Direct Semiparametric Estiamtion of SingleIndex Models with Discrete Covariates,” Journal of the American Statistical Association, 91, No. 436, 1632–1640. Horst, U., and J. Scheinkman (2006): “Equilibria in Systems of Social Interactions,” Journal of Economic Theory. (2009): “A Limit Theorem for Systems of Social Interactions,” Journal of Mathematical Economics. Kalai, E. (2004): “Large Robust Games,” Econometrica, 72(6), 1631–1665. Kallenberg, O. (2005): Probabilistic Symmetries and Invariance Principles. Springer. Kingman, J. (1978): “Uses of Exchangeability,” Annals of Probability, 6(2), 183–197. Liggett, T. (2005): Interacting Particle Systems. Springer, Berlin. Manski, C. (1993): “Identification of Endogenous Social Effects: The Reflection Problem,” Review of Economic Studies, 60, 531–542. Martimort, D., and L. Stole (2011): “Aggregate Representations of Aggregate Games with Applications to Common Agency,” working paper, Paris School of Economics, and University of Chicago. Mas-Colell, A., M. Whinston, and J. Green (1995): Microeconomic Theory. Oxford University Press. McLennan, A. (1997): “The Maximal Generic Number of Pure Nash Equilibria,” Journal of Economic Theory, 72, 408–410. Menzel, K. (2011): “Robust Decisions for Incomplete Models of Social Interactions,” working paper, New York University. Milgrom, P., and J. Roberts (1990): “Rationalizability, Learning, and Equilibrium in Games with Strategic Complementarities,” Econometrica, 58(6), 1255–1277. Molchanov, I. (2005): Theory of Random Sets. Springer, London. Nakajima, R. (2007): “Measuring Peer Effects on Youth Smoking Behaviour,” Review of Economic Studies, 74, 897–935. Pakes, A., J. Porter, K. Ho, and J. Ishii (2006): “Moment Inequalities and their Application,” working paper, Harvard University. GAMES WITH EXCHANGEABLE PLAYERS 45 Powell, J., J. Stock, and T. Stoker (1989): “Semiparametric Estimation of Index Coefficients,” Econometrica, 57(6), 1403–1430. Rockafellar, R., and R. Wets (1998): Variational Analysis. Springer, Heidelberg. Shang, Q., and L. Lee (2011): “Two-Step Estimation of Endogenous and Exogenous Group Effects,” Econometric Reviews, 30(2), 173–207. Tamer, E. (2003): “Incomplete Simultaneous Discrete Response Model with Multiple Equilibria,” Review of Economic Studies, 70, 147–165. Topa, G. (2001): “Social Interactions, Local Spillovers and Unemployment,” Review of Economic Studies, 68, 261–295. van der Vaart, A. (1998): Asymptotic Statistics. Cambridge University Press, Cambridge. van der Vaart, A., and J. Wellner (1996): Weak Convergence and Empirical Processes. Springer, New York. Weintraub, G., L. Benkard, and B. van Roy (2008): “Markov Perfect Industry Dynamics With Many Firms,” Econometrica, 78(6), 1375–1711.
© Copyright 2026 Paperzz