AXIOMATIC EQUILIBRIUM SELECTION FOR GENERIC TWO-PLAYER GAMES

2010

We apply three axioms adapted from decision theory to refinements of the Nash equilibria of games with perfect recall that select connected closed sub- sets called solutions. No player uses a weakly dominated strategy in an equilibrium in a solution. Each solution contains a quasi-perfect equilibrium and thus a sequential equilibrium in strategies that provide conditionally admissible optimal continuations from

AXIOMATIC EQUILIBRIUM SELECTION FOR GENERIC TWO-PLAYER GAMES SRIHARI GOVINDAN AND ROBERT WILSON Abstract. We apply three axioms adapted from decision theory to refinements of the Nash equilibria of games with perfect recall that select connected closed subsets called solutions. Undominated Strategies: No player uses a weakly dominated strategy in an equilibrium in a solution. Backward Induction: Each solution contains a quasi-perfect equilibrium and thus a sequential equilibrium in strategies that provide conditionally admissible optimal continuations from information sets. Small Worlds: A refinement is immune to embedding a game in a larger game with additional players provided the original players’ strategies and payoffs are preserved, i.e. solutions of a game are the same as those induced by the solutions of any larger game in which it is embedded. For games with two players and generic payoffs, we prove that these axioms characterize each solution as an essential component of equilibria in undominated strategies, and thus a stable set as defined by Mertens (1989). Date: 28 May 2009. Key words and phrases. game, equilibrium, refinement, axiom, admissibility, backward induction, small worlds, stability. JEL subject classification: C72. This work was funded in part by a grant from the National Science Foundation of the United States. 1 2 SRIHARI GOVINDAN AND ROBERT WILSON Contents List of Figures 1. Introduction 2. Notation 3. The Axioms 3.1. Undominated Strategies 3.2. Backward Induction 3.3. Small Worlds 3.4. Summary of the Axioms 4. Additional Notation and Properties 4.1. Payoffs 4.2. Notation for the Extensive Form 4.3. Enabling Strategies 4.4. Stable Sets of Equilibria 4.5. The Associated Pseudomanifold 5. Statement and Proof of the Theorem 5.1. Preliminaries 5.2. A Game with Redundant Strategies 5.3. Extensive Form of the Metagames 5.4. Outsiders’ Payoffs in the Metagames 5.5. Outsiders’ Strategies in a Quasi-Perfect Equilibrium 5.6. Limits of the Quasi-Perfect Equilibria of the Metagames 5.7. Insiders’ Strategies in a Quasi-Perfect Equilibrium 5.8. The Induced Lexicographic Probability System 5.9. Limit of the Lexicographic Probability System 5.10. Final Step of the Proof 6. Concluding Remarks Appendix A. Enabling Strategies Appendix B. Proofs of Propositions B.1. Proof of Proposition 3.3 B.2. Proof of Proposition 3.4 Appendix C. Proof of Theorem 4.1 Appendix D. The Genericity Assumption References 2 3 5 6 6 6 7 9 10 10 10 11 12 13 13 14 15 16 17 17 19 20 21 22 24 25 27 28 28 29 30 45 46 List of Figures 1 2 A game tree and its pure reduced normal form in which each pure strategy is identified by the terminal nodes it does not exclude. 27 The game tree of a signaling game and its pure reduced normal form in which each pure strategy is identified by the terminal nodes it does not exclude. 28 AXIOMATIC EQUILIBRIUM SELECTION 3 1. Introduction Nash’s [22, 23] definition of equilibrium is insufficient when applied to a game in extensive form. Even for games with perfect information, some equilibria use weakly dominated strategies and yield outcomes different from the outcome predicted by backward induction; see [11, §4.2] for an example. The literature on refinements aims to sharpen Nash’s definition to exclude such equilibria; recent surveys include [8, 12, 31]. Kohlberg and Mertens [14] suggest that a refinement should be characterized by axioms adapted from decision theory. They also specify properties that axioms should imply. Subsequently, Mertens [19, 20, 21] defines the set-valued refinement called stability and shows that it has these and other properties, including the following.1 1. Admissibility and Perfection. All equilibria in a stable set are perfect (hence admissible) so each player’s strategy in each equilibrium is undominated. 2. Backward Induction and Forward Induction. A stable set includes a proper equilibrium that induces a quasi-perfect (hence sequential) equilibrium in every extensiveform game with perfect recall that has the same normal form. A subset of a stable set survives iterative elimination of weakly dominated strategies and strategies that are inferior replies at every equilibrium in the set. 3. Invariance, Small Worlds and Decomposition. The stable sets of a game are the projections of the stable sets of any larger game in which it is embedded. The stable sets of the product of two independent games are the products of their stable sets. For games in extensive form with perfect recall, we propose three axioms that are versions of these properties. We assume that for each game a refinement selects a nonempty collection of nonempty connected closed subsets of equilibria, called solutions.2 The axioms are the following: A. Undominated Strategies: No player uses a weakly dominated strategy in any equilibrium in a solution. 1 Mertens considers the graph of equilibria over the space of perturbed games obtained by perturbing players’ strategies toward mixed strategies, and then defines a connected closed set of equilibria as stable if the projection from a neighborhood in this graph is an essential map, i.e. has nonzero degree. See Mertens [19] for further topological aspects of the definition, and Govindan and Mertens [3] for an equivalent definition in terms of players’ best-reply correspondences. 2 Solutions are assumed to be sets because Kohlberg and Mertens [14, pp. 1015, 1019, 1029] show that there need not exist a single equilibrium satisfying weaker properties than the axioms invoked here. The technical requirement that a solution is connected excludes the trivial refinement that selects all equilibria. If only a single (possibly unconnected) subset is selected then only the trivial refinement satisfies the conditions invoked by Norde, Potters, Reijnierse, and Vermeulen [24]. Kreps and Wilson [16, Theorem 2] show that if payoffs are generic then all equilibria in a connected set yield the same probability distribution over terminal nodes, and thus the same paths of equilibrium play in the extensive form. 4 SRIHARI GOVINDAN AND ROBERT WILSON B. Backward Induction: A solution contains a quasi-perfect equilibrium. S. Small Worlds: A solution is immune to embedding the game within larger games that preserve players’ strategies and payoffs. In Axiom A we invoke only the implication of admissibility that no player uses a weakly dominated strategy. Axiom A is equivalent to admissibility in two-player games. Axiom B insists on inclusion of a quasi-perfect equilibrium because it induces a sequential equilibrium in strategies that provide admissible optimal continuations from players’ information sets.3 Axiom S ensures that a refinement is not vulnerable to framing effects depending on how the game is presented within wider contexts called metagames. A metagame can include additional players and additional pure strategies, provided these additional features do not alter optimal decisions in the original game. Our version of small worlds is stronger than Mertens’ [21] version because our definition of embedding allows a more general class of metagames. We prove that these three axioms characterize refinements that select some or all of the stable sets of a game in extensive form with perfect recall, two players, and generic payoffs. Our characterization is cast in terms of the ‘enabling form’ of a game in which two pure strategies of a player are considered equivalent if they exclude the same terminal nodes of the game tree, and thus enable the same sets of terminal nodes to occur as outcomes of the game. The enabling form is defined in Section 4.3 and explained further in Appendix A. Our main theorem establishes that the axioms imply that each solution is an essential component of admissible equilibria. For the enabling form of a game this is the defining property of a stable set.4 When payoffs are generic, all equilibria in a component yield the same probability distribution over outcomes. Therefore, for economic modeling and econometric studies the axioms’ chief implication is that a predicted outcome distribution should result from an essential component of the game’s admissible equilibria. A secondary implication is that a solution must include all equilibria in the component. This is germane for predicting players’ behaviors only after deviations from equilibrium play, but it addresses the issue of whether sequential rationality is a relevant decision-theoretic criterion after deviations (Reny [26]). We show that each equilibrium in a solution is induced by a quasi-perfect equilibrium in a 3 We intend that a revised version will weaken Axiom B to only require inclusion of a sequential equilibrium for which each player’s strategy is conditionally admissible from each of his information sets. 4 A component is a maximal closed connected set, and it is essential if it has the property described in footnote 1. For the usual normal form of the game, a solution can be a subset of a component that maps to a component of the enabling form of the game. AXIOMATIC EQUILIBRIUM SELECTION 5 solution of some metagame; therefore, it is sequentially rational when viewed in the wider context of this metagame. See [7, Section 2.3] for an example. Section 2 establishes notation for Section 3, which specifies Axioms A (undominated strategies), B (backward induction), and S (small worlds), including a precise definition of embedding a game in a larger game. Appendix B verifies that Nash equilibria satisfy this definition of embedding. Section 4 establishes further notation and states a key technical property proved in Appendix C. Section 5 states and proves the main theorem, Theorem 5.1. The proof is constructive in that each equilibrium in a stable set is shown to be induced by a quasi-perfect equilibrium in a solution of a particular metagame with perfect recall that embeds the given game. Section 6 interprets this result and provides concluding remarks. 2. Notation This section provides sufficient notation for statements of the axioms in Section 3. Section 4 introduces additional notation for the theorems in Section 5 and Appendix C. A typical game in extensive form is denoted Γ. Its specification includes a set N of players, a game tree with perfect recall for each player, and an assignment of players’ payoffs at each terminal node of the tree. Let Hn be player n’s collection of information sets, and let An (h) be his set of feasible actions at information set h ∈ Hn . The specification of the tree can include a completely mixed strategy of Nature. A player’s pure strategy chooses an action at each of his information sets. Denote n’s simplex of mixed strategies by Σn and interpret its vertices as his set Sn of pure strategies. Q Q The sets of profiles of players’ pure and mixed strategies are S = n Sn and Σ = n Σn . The normal form of Γ assigns to each profile of players’ pure strategies the profile of their expected payoffs; equivalently, it is the multilinear function G : Σ → RN that assigns to each profile of their mixed strategies the profile of their expected payoffs. A player’s behavioral strategy specifies a mixture over his actions at each of his information Q sets. Let Bn be n’s set of behavioral strategies, and B = n Bn the set of profiles of players’ behavioral strategies. Each mixed strategy σn induces a behavioral strategy bn . Because the game has perfect recall, for each behavioral profile there are profiles of mixed strategies that induce it and yield the same distribution of outcomes (Kuhn [17]). As defined by Nash [22, 23], an equilibrium is a profile of players’ strategies such that each player’s strategy is an optimal reply to other players’ strategies. That is, if BRn (σ) ≡ arg maxσn′ ∈Σn Gn (σn′ , σ−n ) is player n’s best-reply correspondence, then σ ∈ Σ is an equilibrium iff σn ∈ BRn (σ) for every player n. The analogous definition of equilibrium in behavioral strategies is equivalent for games with perfect recall. 6 SRIHARI GOVINDAN AND ROBERT WILSON A refinement is a correspondence that assigns to each game a nonempty collection of nonempty connected closed subsets of its equilibria. Each selected subset is called a solution. According to the above definitions, a mixed or behavioral strategy makes choices even at information sets that its previous choices exclude from being reached. In Section 4 we consider pure strategies to be equivalent if they make the same choices at information sets they do not exclude. And, we further simplify mixed and behavioral strategies by considering only their induced probability distributions on non-excluded terminal nodes. The definitions of an equilibrium and a refinement have equivalent statements in terms of these strategy spaces. Each equilibrium in a reduced strategy space corresponds to a set of equilibria as defined above, and analogously for solutions selected by a refinement. The axioms in Section 3 are stated in terms of mixed and behavioral strategies as defined above. One reason for this is that Axiom S implies invariance to redundant strategies, and thus justifies our later use of equivalence classes of strategies. 3. The Axioms 3.1. Undominated Strategies. The first axiom requires that no player uses a weakly dominated strategy. Say that a profile of players’ strategies is undominated if each player’s strategy is undominated. Axiom A (Undominated Strategies): Each equilibrium in a solution is undominated. 3.2. Backward Induction. We interpret sequential equilibrium as the generalization to games with perfect recall of backward induction in games with perfect information, and to be consistent with Axiom A we insist on conditionally admissible optimal continuations from information sets. Here we obtain these properties from a quasi-perfect equilibrium, which requires a consistency property of beliefs that is stronger than required by Kreps and Wilson’s [16] definition of sequential equilibrium. Van Damme [30, p. 8] defines a quasiperfect equilibrium in terms of players’ behavioral strategies. Definition 3.1 (Quasi-Perfect Equilibrium). A profile b ∈ B of behavioral strategies is a quasi-perfect equilibrium if it is the limit of a sequence of profiles of completely mixed behavioral strategies for which, for each player n, from each of his information sets, continuation of his strategy bn is an optimal reply to every profile in the sequence. That is, if BRn (·|h) is n’s best-reply correspondence in terms of behavioral strategies that continue from his information set h ∈ Hn , and bn (h) is the continuation of his behavioral AXIOMATIC EQUILIBRIUM SELECTION 7 strategy bn from this information set, then the profile b ∈ B is quasi-perfect if (∀ k) (∀ n ∈ N, h ∈ Hn ) bn (h) ∈ BRn (b̂k |h) for some sequence b̂k ∈ B \ ∂B for which b̂k → b. If the mixed-strategy profile σ ∈ Σ induces a behavioral profile b ∈ B that is a quasi-perfect equilibrium then we say that σ too is quasi-perfect. Similarly, the justifying sequence b̂k can be represented by a sequence σ̂ k in Σ \ ∂Σ for which σ̂ k → σ. Van Damme shows that a quasi-perfect equilibrium induces a perfect equilibrium of the normal form, and a sequential equilibrium of the extensive form in which all players’ beliefs are induced by the same justifying sequence. Moreover, by construction a quasi-perfect equilibrium provides for each player an admissible optimal continuation from each of his information sets. It can be shown that if payoffs are generic then every sequential equilibrium is quasi-perfect and extensive-form perfect. The second axiom requires that some equilibrium in a solution is quasi-perfect. Axiom B (Backward Induction): Each solution contains a quasi-perfect equilibrium. If payoffs are generic then Axioms A and B imply that each solution lies in a component of undominated equilibria, each of which yields the same distribution over outcomes as a sequential equilibrium in the solution. 3.3. Small Worlds. The third axiom requires that a refinement is not affected by extraneous features of wider contexts in which a game is embedded, provided such contexts do not alter players’ feasible strategies and payoffs. An embedding allows the presence of additional players whose actions might provide the original players with additional pure strategies equivalent to mixed strategies in the original game, and thus redundant. For simplicity, to define an embedding we use the normal form G : Σ → RN of the extensive-form game Γ. An embedding is described by a ‘larger’ game G̃ : Σ̃ × Σo → RN ∪ o in which game G is ‘embedded,’ subject to certain restrictions specified below. The larger game G̃ has outsiders in a set o, in addition to insiders who are the players in N , and there can be additional moves by Nature. An insider n can have additional pure strategies in Σ̃n that are not pure strategies in Sn but are equivalent to mixed strategies in Σn . The basic requirement is that an embedding should preserve the game among insiders, conditional on actions by outsiders. These restrictions have a technical formulation. There should exist a multilinear map f : Σ̃ × Σo → Σ that is surjective and such that G̃n = Gn ◦ f for each insider n. Moreover, to exclude an embedding from introducing correlation among insiders’ strategies, f should factor into separate multilinear maps (fn )n∈N , where each component is a map fn : Σ̃n ×Σo → 8 SRIHARI GOVINDAN AND ROBERT WILSON Σn such that fn (·, σo ) maps Σ̃n surjectively onto Σn for each profile σo ∈ Σo of outsiders’ strategies. Admittedly, a statement of the axiom that uses this technical language could contain unsuspected implications. However, after stating the formal definition, we provide in Proposition 3.3 an equivalent formulation that is more detailed and more transparent, and that verifies the requisite properties. Also, Proposition 3.4 applies a precise test of whether the axiom is correctly stated, namely, a refinement that satisfies the axiom should be immune to the same embeddings that equilibria are. Definition 3.2 (Embedding). A game G̃ : Σ̃ × Σo → RN ∪ o and a collection of multilinear maps fn : Σ̃n × Σo → Σn , one for each player n ∈ N , embed a game G : Σ → RN if (a) for each σo ∈ Σo , fn (·, σo ) maps Σ̃n surjectively onto Σn , and (b) G̃n = Gn ◦ f , where f = (fn )n∈N . Condition (a) ensures that embedding has no net effect on an insider’s set of mixed strategies, conditional on outsiders’ strategies, and condition (b) ensures that there is no net effect on any insider’s payoffs. Hereafter, if G̃ embeds G via maps f = (fn ) then we say that (G̃, f ) embeds G and that G̃ is a metagame for G. We omit description of f for metagames in extensive form that embed a game in extensive or strategic form. An elaborate example of a metagame in extensive form that embeds a game in extensive form is constructed in proving Theorem 5.1. Note that the property that G̃ is a metagame for G does not depend on specifying payoffs for the outsiders; in particular, it depends only on their extensive forms and the insiders’ payoffs in the two games. A multilinear map fn : Σ̃n × Σo → Σn is completely specified by its values at profiles of pure strategies. Let fˆn be the restriction of fn to the set S̃n × So of profiles of pure strategies of player n and the outsiders in o. The following Proposition, proved in Appendix B, provides an alternative definition of embedding in terms of pure strategies. Proposition 3.3. G̃ embeds G via a collection of multilinear maps f = (fn )n∈N if and only if for each player n there exists T̃n ⊆ S̃n and a bijection πn : T̃n → Sn such that for each (s̃, so ) ∈ S̃ × So and t̃n ∈ T̃n : (1) fˆn (t̃n , so ) = πn (t̃n ), (2) G̃n (s̃, so ) = Gn (fˆ(s̃, so )), where fˆ = (fˆn )n∈N . AXIOMATIC EQUILIBRIUM SELECTION 9 Property (2) assures that players’ payoffs from pure strategies of G are preserved by the metagame G̃. Hence property (1) assures that each pure strategy sn ∈ Sn is equivalent to some pure strategy t̃n = π −1 (sn ) ∈ T̃n , independently of the outsiders’ profile so . Q Pure strategies in S̃n \ T̃n are redundant because payoffs from profiles in n T̃n exactly Q replicate payoffs from corresponding profiles in n Sn for the embedded game G. In particular, if fˆn (s̃n , so ) = σn ∈ / Sn then conditional on so the pure strategy s̃n is equivalent for insiders to the mixed strategy σn ∈ Σn . The next Proposition, proved in Appendix B, verifies that equilibria are not affected by embedding in a metagame. Proposition 3.4. If (G̃, f ) embeds G then the equilibria of G are the f -images of the equilibria of G̃. A corollary of Proposition 3.4 is that embedding does not introduce correlation among insiders’ strategies. Using Definition 3.2 of embedding, the small worlds axiom is the following. Axiom S (Small Worlds): If (G̃, f ) embeds G then the f -images of the solutions that a refinement selects for G̃ are the solutions selected for G. In view of Proposition 3.4, this axiom is an instance of the general principle that a refinement should inherit invariance properties of equilibria. Two special cases of Axiom S are the following. Invariance: Suppose Σo and o are singletons and insiders’ payoffs and strategies in Σ̃ differ from Σ only by treating some mixed strategies in Σ as additional pure strategies in S̃. Then Axiom S implies that solutions depend only on a game’s reduced normal form obtained by deleting such redundant pure strategies. Mertens’ Small Worlds Axiom [21]: Suppose Σ̃ = Σ. Then Axiom S implies that a solution does not depend on the presence of outsiders, i.e. solutions of the original game are the projections of the solutions of metagames obtained by adding dummy players. 3.4. Summary of the Axioms. We study refinements that are independent of embeddings in metagames that, for each profile of outsiders’ strategies, preserve the strategies and payoffs of the game among insiders. And, we require that each of their solutions is a closed connected set of undominated equilibria, including one that is quasi-perfect. In particular, a solution of a metagame must contain a quasi-perfect equilibrium whose image is in the corresponding solution of the embedded game. 10 SRIHARI GOVINDAN AND ROBERT WILSON 4. Additional Notation and Properties In the sequel we consider only a game Γ in extensive form with perfect recall, two players, and generic payoffs. In this section we prepare for the statement and proof of the main theorem in Section 5. 4.1. Payoffs. Let Z be the set of terminal nodes of the game tree. Players’ payoffs are given by a point u in U = RN ×Z , where un (z) is the payoff to player n ∈ N at terminal node z ∈ Z. We assume that payoffs are generic in that there exists a lower dimensional subset U◦ of U such that our results are true for all games in U \ U◦ . The set U◦ includes the nongeneric set described in [4]. Therefore, each game outside U◦ has finitely many equilibrium outcome distributions, and in particular all equilibria in a component yield the same distribution over outcomes. However, the proofs in Section 5 and Appendix C require some genericity properties that are not necessarily implied by the construction in [4]. To avoid disrupting the main exposition, we defer to Appendix D the description of the exact set of genericity properties required for the proofs, and an explanation of why the resulting set U◦ of excluded payoffs has lower dimension. 4.2. Notation for the Extensive Form. The set of players is N = {1, 2}, typically represented as a player n and the other player m 6= n. Let X be the set of nodes in the game tree. Let Xn be the set of nodes where player n moves, partitioned into his information sets h ∈ Hn . For a node x ∈ Xn we write h(x) for the unique information set h ∈ Hn that contains x. For each n and h ∈ Hn , let An (h) be the set of actions available to player n at h. Assume that actions at all information sets are labeled differently, and let An be the set of all actions for player n. Node x precedes another node y, written x ≺ y, if x is on the unique path from the root of the tree to y. For a node x ∈ Xn and a ∈ An (h(x)) write (x, a) ≺ y if x ≺ y and the path from the root of the tree to y requires player n to choose a at h(x). If (x, a) ≺ y and x and y belong to n’s information sets h and h′ , respectively, then every node in h′ follows some node in h by the choice of a, so we write (h, a) ≺ h′ . The set of pure strategies of player n is the set Sn of functions sn : Hn → An such that sn (h) ∈ An (h) for all h ∈ Hn . For each n, sn ∈ Sn and y ∈ X, let βn (y, sn ) be the probability that sn does not exclude y, i.e. βn (y, sn ) = 1 if for each (x, a) ≺ y with x ∈ Xn , sn (h(x)) = a, and otherwise βn (y, sn ) = 0. By perfect recall, if y ∈ Xn then βn (y ′ , sn ) = βn (y, sn ) for all y ′ ∈ h(y) and we write βn (h(y), sn ) for this probability. Likewise, for any node y we write β0 (y) for the probability that Nature does not exclude y. Then for a profile s ∈ S the probability that node y is reached is β(y, s) ≡ β0 (y)β1 (y, s1 )β2 (y, s2 ). AXIOMATIC EQUILIBRIUM SELECTION 11 Recall that Σn is the set of mixed strategies of player n. For each node y the function P βn (y, ·) extends to a function over Σn via βn (y, σn ) = sn ∈Sn βn (y, sn )σn (sn ) for σn ∈ Σn . Recall also that Bn is the set of behavioral strategies of player n. For each bn ∈ Bn , βn (y, bn ) is the product of bn ’s probabilities of n’s actions on the path to y. Similarly extend β to profiles of mixed or behavioral strategies. Given a mixed-strategy profile σ ∈ Σ, the probability that outcome z results is β(z, σ) = β0 (z)β1 (z, σ1 )β2 (z, σ2 ). 4.3. Enabling Strategies. For each player n define ρn : Σn → [0, 1]Z by the formula ρn (σn ) = (βn (z, σn ))z∈Z , and let ρ = (ρn )n∈N . Similarly, if bn ∈ Bn is the behavioral strategy Q induced by σn then ρn (σn ) = (βn (z, bn ))z∈Z . Let Pn be the image of ρn , and P = n Pn the image of ρ. Then Pn is a compact convex polyhedron, called the space of n’s enabling strategies in [5]. Each vertex of Pn corresponds to an equivalence class of n’s pure strategies that exclude the same outcomes.5 If σ ∈ Σ and p = ρ(σ) then the probability of outcome z is γz (p) = β0 (z)p1 (z)p2 (z). Thus the function γ : P → ∆(Z) summarizes the extensive form. The analog of the game Γ’s normal form G : Σ → RN is the enabling form G : P → RN that assigns to each profile P of enabling strategies the profile of players’ expected payoffs, where Gn (p) = z γz (p)un (z). Note that γ and G are multilinear functions. From players’ best-reply correspondences in terms of enabling strategies one obtains the definition of equilibrium in enabling strategies. To each equilibrium in enabling strategies there correspond families of outcome-equivalent equilibria in behavioral and mixed strategies. Axioms A and S have direct analogs in terms of enabling strategies, as shown in [11]. For games in extensive form with perfect recall, enabling strategies are minimal representations. For example, using perfect recall, by working backward in the induced tree of a player’s information sets, from his enabling strategy one can construct the corresponding behavioral strategy at his information sets that his prior actions do not exclude. Because Axiom S implies Invariance, it is immaterial whether solutions are characterized in terms of mixed, behavioral, or enabling strategies. We use enabling strategies here because induced distributions over outcomes are multilinear functions of enabling strategies, like they are for mixed strategies but unlike the nonlinear dependence on behavioral strategies. Also, the dimensions of the spaces of enabling and behavioral strategies are the same, which is important for the technical property established in Theorem 4.1 below. Using these features, Section 5 derives the implications of the axioms in terms of enabling strategies. 5In [5], enabling strategies are defined in terms of terminal actions rather than terminal nodes. The present definition is equivalent because a player’s probability of enabling a terminal action is the probability of enabling each terminal node that follows it. The vertices of Pn are n’s pure strategies in the ‘pure reduced normal form’ defined by Mailath, Samuelson, and Swinkels [18]. See Appendix A for illustrations. 12 SRIHARI GOVINDAN AND ROBERT WILSON 4.4. Stable Sets of Equilibria. Now let Σ̄∗ be a component of the equilibria of Γ in terms of mixed strategies, and let Σ̄∗n be the projection of Σ̄∗ in Σn . By genericity, all equilibria in Σ̄∗ induce the same distribution over outcomes. Therefore, for each node x, β(x, σ) is the same for all σ ∈ Σ̄∗ ; in particular, if x belongs to information set h ∈ Hn and h is on an equilibrium path then βn (h, σn ) is the same for every equilibrium strategy σn of player n in Σ̄∗ . We therefore denote these probabilities by βn∗ (x) and βn∗ (h). Let Hn∗ be the collection of information sets h ∈ Hn of player n such that βn∗ (h) > 0 and let A∗n be the set of actions at information sets in Hn∗ that are chosen with positive probability by the equilibria in B̄ ∗ , where B̄ ∗ is the set of profiles of behavioral strategies induced by equilibria in Σ̄∗ . Let Sn0 ⊂ Sn be the set of pure strategies s0n with the property that, at each information set h ∈ Hn∗ that s0n does not exclude, s0n prescribes an action in A∗n . Let Sn1 = Sn \ Sn0 , i.e. each pure strategy sn in Sn1 chooses a non-equilibrium action at some information set h ∈ Hn∗ that it does not exclude. For i = 0, 1, let Σin be the set of mixed strategies whose support is contained in Sni . Observe that the support of n’s strategy in every equilibrium in Σ̄∗ is contained in Sn0 and that every strategy in Sn0 is a best reply against every equilibrium in Σ̄∗ . Thus Σ̄∗n is contained in Σ0n and Σ̄∗ = Σ̄∗1 × Σ̄∗2 . If Sn1 is empty for each n, then each equilibrium in B̄ ∗ is completely mixed; by genericity, B̄ ∗ is a singleton and its equivalent mixed strategy is stable. The only interesting case, therefore, is one where Sn1 is nonempty for at least one of the players. In order to avoid dealing with different cases, we assume that Sn1 is nonempty for each n. (Along the way, we will indicate how the proof changes when Sn1 is empty for exactly one player.) Let Pn0 be the image of Σ0n under ρn . Let Zn1 ⊂ Z be the set of terminal nodes z such that (h, a) ≺ z for some h ∈ Hn∗ and a ∈ / A∗n . Let Zn0 = Zn \ Zn1 . Then Pn0 is the set of pn ∈ Pn such that pn (z) = 0 for all z ∈ Zn1 and thus Pn0 is a face of Pn . However, the image Pn1 of Σ1n under ρn need not be a face of Pn . For i = 0, 1, let P i = P1i × P2i and define P = P 0 × P 1 . Z1 For each pn ∈ Pn , let ψZn1 (pn ) be the projection of pn to R+n ; then ψZn1 (pn ) = 0 iff pn ∈ Pn0 . Fix a point p̄m in the interior of Pm and define ηn : Pn → R by η(pn ) = P 0 1 p0 (z)p̄m (z)pn (z), where p0 is Nature’s enabling strategy. Then η(pn ) = 0 iff pn ∈ Pn . z∈Zn 1 Choose ε > 0 such that ηn (pn ) > ε for all pn ∈ Pn1 . Let Hn be the hyperplane in RZn with normal (p0 (z)p̄m (z))z∈Zn1 and constant ε. Hn separates the origin from ψZn1 (Pn1 ). Let Π1n be the intersection of Hn with ψZn1 (Pn1 ). Then there is a function π̄n1 from Pn \ Pn0 to Π1n that maps each pn ∈ / Pn0 to the point ε(ηn (pn ))−1 ψZn1 (pn ). AXIOMATIC EQUILIBRIUM SELECTION 13 Now let Σ∗ be a component of the undominated equilibria of the game Γ that is contained in Σ̄∗ . Since Σ̄∗ = Σ̄∗1 × Σ̄∗2 , Σ∗ is expressible as a product Σ∗1 × Σ∗2 , where Σ∗n is a component of the intersection of Σ̄∗n with the set of undominated strategies. For each n, let Q∗n be the image of Σ∗n under ρn , and Q∗ the image of Σ∗ under ρ, i.e. represented in enabling strategies. Hereafter we say that Q∗ is stable if Σ∗ is a stable set of equilibria.6 4.5. The Associated Pseudomanifold. Given Q∗ , let Q be the set of (q ∗ , (p0 , p1 ), π 1 ) ∈ Q∗ × P × Π1 such that there exist r0 , p̃0n ∈ Pn0 , r1 ∈ Pn1 , and for each n scalars λ0n , λ1n , µ1n in the interval (0, 1] such that, if qn0 = λ0n p0n + (1 − λ0n )rn0 and qn1 = (1 − λ1n )p̃0n + λ1n (µ1n p1n + (1 − µ1n )rn1 ) , then for each n:7 (i) π̄n1 (qn1 ) = πn1 . ∗ 0 1 (ii) qn0 , and rn0 if λ0n < 1, are lexicographic best replies against (qm , qm , qm ). (iii) If µ1n < 1 then rn1 is a best reply against qn∗ and lexicographically as good a reply ∗ 0 1 against (qm , qm , qm ) as other strategies in Pn1 . Let ψ : Q → P be the natural projection, i.e. ψ(q ∗ , (p0 , p1 ), π 1 ) = (p0 , p1 ). Let ∂Q = ψ −1 (∂P). The following technical property is proved in Appendix C. Theorem 4.1. (Q, ∂Q) is a pseudomanifold of the same dimension as (P, ∂P). Moreover, the projection map ψ : (Q, ∂Q) → (P, ∂P) is essential iff Q∗ is stable. 1 In case Sn1 is empty (and Sm is not), then the set Π1n is empty. We set P = P 0 × Pm1 . 1 Points in Q are of the form (q 0 , p0 , πm ) and we drop the optimality requirement (iii) for n. 5. Statement and Proof of the Theorem We now state and prove the main theorem. We assume that a solution is represented in terms of enabling strategies, i.e. Q∗ ⊂ P is a solution iff it is the image under ρ of a solution Σ∗ ⊂ Σ. Theorem 5.1. Axioms A, B, S imply that each solution is an essential component of the game’s undominated equilibria. 6Alternatively, as noted by Mertens [21], one can apply the definition of stability directly to Q∗ as a component of equilibria, represented in terms of enabling strategies, over the space of perturbations of players’ enabling strategies. 7Lexicographically optimal replies are defined by Blume, Brandenburger, and Dekel [1] and Govindan and Klumpp [2]. 14 SRIHARI GOVINDAN AND ROBERT WILSON Proof. Let Q∗ ⊂ P be a solution. Let deg(ψ) be the degree of the projection map ψ : (Q, ∂P ) → (P, ∂P) defined in Section 4.5 above. Let q 0,∗ be an arbitrary point in Q∗ . For 0,∗ 0 is a best reply. Such a each m, choose a point p0,∗ m in the interior of Pm against which q choice is possible by genericity of payoffs: the interior of the projection of Σ̄∗m belongs to the interior of Pm0 and all strategies in Pn1 are inferior replies against every such point. Since qn0,∗ belongs to Q∗n , which consists only of undominated (hence admissible) strategies, there exists a point pm in the interior of Pm against which qn0,∗ is a best reply. pm is equivalent 0 to a completely mixed strategy σm in Σm . Express σm as a convex combination of σm and 1 i i 0 1,∗ σm , where for i = 0, 1, σm belongs to the interior of Σm . Let pm and pm be the enabling 0 1 strategies that are equivalent to σm and σm , respectively. Then p0m and p1,∗ m are in the relative 0 1 0 interiors of Pm and Pm respectively, and pm is a convex combination of pm and p1,∗ m . It follows ∗ 0,∗ 0,∗ 1,∗ 1,∗ 1,∗ 1 1 that x ≡ (q , (p , p ), π ) belongs to Q \ ∂Q, where for each n, πn = π̄n (pn ), and in 1 1 the definition of Q, qn0 is p0,∗ n and qn is pn . For any neighborhood U (q 0,∗ ) of q 0,∗ we can choose a neighborhood U (x∗ ) of x∗ that is homeomorphic to a simplex, is contained in Q \ ∂Q and such that the projection onto the first factor is contained in the neighborhood U (q 0,∗ ), i.e. if (q 0 , (p0 , p1 ), π 1 ) ∈ U (x∗ ) then q 0 ∈ U (q 0,∗ ). For the proof we construct a metagame such that: (i) if deg(ψ) is zero then there is no quasi-perfect equilibrium of the metagame projecting to an equilibrium in Q∗ ; and (ii) if deg(ψ) is nonzero then all its quasi-perfect equilibria that project to points in Q∗ , project to points in U (q 0,∗ ). Since U (q 0,∗ ) is an arbitrary neighborhood of the arbitrary point q 0,∗ in Q∗ , this proves the theorem. 5.1. Preliminaries. Fix some p̄ in the interior of P and define ι : ∂P → ∂P as follows: ι(p) is the unique point in the boundary of the form λp + (1 − λ)p̄ for λ < 0. Then ι has no fixed point. Let g∂Q : ∂Q → P be the function ι ◦ ψ∂Q , where ψ∂Q is the restriction of ψ to ∂Q. Then g∂Q has no point of coincidence with ψ, i.e., for each x ∈ ∂Q, g∂Q (x) 6= ψ(x). Let d be the dimension of P and let α be the degree of ψ. If α = 0 then, since ι is a homeomorphism, ∗ (1) = 0 in H δ (Q, ∂Q), where i : ∂Q → Q is the inclusion map, 1 is the generator δ ∗ i∗ g∂Q of H d−1 (P) ≈ Z, and δ ∗ is the coboundary operator. By the Hopf Extension Theorem [29, Corollary 8.1.18], g∂Q can be extended to a map g : Q → ∂P. Obviously g has no point of coincidence with ψ. If α 6= 0 then construct a map g∂U (x∗ ) from ∂U (x∗ ) to ∂P of degree ∗ δ ∗ ∗ ∗ α such that δ ∗ i∗ g∂Q∪∂U (x∗ ) (1) = 0 in H (Q \ (U (x ) \ ∂U (x )), ∂Q ∪ ∂U (x )), where now i : (∂Q ∪ ∂U (x∗ )) → Q \ (U (x∗ ) \ ∂U (x∗ )) is the inclusion map and δ ∗ is the coboundary operator. Using again the Hopf Extension Theorem, there exists a map g∂U (x∗ ) such that the two maps g∂Q and g∂U (x∗ ) can be extended to a map from Q \ (U (x∗ ) \ ∂U (x∗ )) to P; AXIOMATIC EQUILIBRIUM SELECTION 15 furthermore, by mapping points in U (x∗ ) to P in a way that extends g∂U (X ∗ ) , we obtain a map g : Q → P such that all its points of coincidence with ψ, of which there is at least one, are contained in U (x∗ ) \ ∂U (x∗ ). By construction there exists α > 0 such that for all x ∈ Q, kg(x)−ψ(x)k 6 α iff α 6= 0 and x ∈ U (x∗ ). Choose a triangulation Kni of Pni for each n and i = 0, 1 such that the diameter of each simplex is no more than α/2. For each n there exists for each i a triangulation Lin of Pni 1 and a triangulation LΠ n of Πn such that, letting L be the resulting multisimplicial subdivision of P 0 ×P×Π1 , g has a multisimplicial approximation g̃ [9, Appendix B] with the triangulation Q of the range given by K = n,i Kni . Observe that if for some x = (q 0 , p0 , p1 , π 1 ) there exists a multisimplex K that contains ψ(x) and g̃(x) then, since g̃ is a multisimplicial approximation of g, that multisimplex is a face of one that contains g(x) and thus kg(x) − ψ(x)k 6 α; in particular, if x also belongs to Q, then x ∈ U (x∗ ), and thus q 0 ∈ U (q 0,∗ ). As in [9, Theorem B.2], now take a further polyhedral subdivision T of L and let γ be the convex function generated by T , i.e. γ is piecewise linear and linear precisely on each full-dimensional polyhedron of the subdivision. Hereafter, unless explicitly stated, we represent players’ pure and mixed strategies in terms of the induced enabling strategies; thus, pure strategies in Sn are represented as vertices of Pn and mixed strategies are mixtures of these vertices. 5.2. A Game with Redundant Strategies. We first construct a game with redundant pure strategies that is the basis for the metagame specified later. For each fixed p̂ = (p0 , p1 ) ∈ P and δ ∈ (0, 1), consider the following game Γ(δ, p̂). Each player n chooses a strategy in the following manner (and unaware of his opponent’s choices). Initially, player n provisionally chooses a pure strategy s0n ∈ Sn0 , or he rejects all strategies in Sn0 . • If initially he chooses a strategy s0n then at a subsequent second stage he can retain s0n or revise his choice. At this second stage the revisions available are ‘duplicate’ pure strategies in a set Tn0 (δ, p0n ), each of which is a mixed strategy of the form tn (δ, p0n ) ≡ (1 − δ)tn + δp0n for some tn ∈ Sn0 . • If he rejects all strategies in Sn0 then at a second stage he can choose among the strategies in Tn0 (δ, p0n ) or reject them all.8 If he rejects them all then at a third stage he chooses among the pure strategies in Sn1 ∪ Tn1 (δ, p1n ), where each strategy in Tn1 (δ, p1n ) is a duplicate of the form tn (δ, p1n ) ≡ (1 − δ)tn + δp1n for some tn ∈ Sn0 . 8It would have sufficed, at this stage, to give player n the option of playing just the strategy p0n instead of all the strategies in Tn0 (δ, p0n ), which we do only for economy in notation. 16 SRIHARI GOVINDAN AND ROBERT WILSON In Γ(δ, p̂) the set of n’s pure strategies is S̃n (δ, p̂n ) ≡ Sn ∪ Tn0 (δ, p0n ) ∪ Tn1 (δ, p1n ). Thus, game Γ(δ, p̂) has the same reduced normal form as Γ. 5.3. Extensive Form of the Metagames. Now we specify a family of similar metagames Γ̃δ , one for each δ > 0. Before the insiders play, thirteen outsiders, denoted players o0 and oin,j for n = 1, 2, i = 0, 1 and j = 1, 2, 3, move simultaneously. Outsider o0 chooses a full-dimensional polyhedron T of the polyhedral complex T . Outsider oin,j , for n = 1, 2, j = 1, 3 and i = 0, 1, chooses a point in the vertex set Vni of Kni . Outsider o0n,2 chooses a point in a finite subset Sn0,δ of Pn0 chosen such that every point in Pn0 is within δ of some point in Sn0,δ ; outsider o1n,2 chooses a point in a finite subset Sn1,δ of Π1n such that every point in Π1n is within δ of some point in Sn1,δ . For outsider oin,1 , each pure strategy vni corresponds to a point in Pni denoted pin (vni ). i i Therefore, each mixed strategy σn,1 corresponds to a point in Pni , denoted pin (σn,1 ), which is obtained by taking the appropriate average of the points induced by the pure strategies in i 0 0 the support of σn,1 . Likewise a mixed strategy σn,2 of o0n,2 corresponds to a point qn0 (σn,2 ) in 1 1 ) in Π1n . of o1n,2 corresponds to a point πn1 (σn,2 Pn0 , and a mixed strategy σn,2 A mixed-strategy profile σ̃o for the outsiders induces a point (q 0 (σ̃o ), (p0 (σ̃o ), p1 (σ̃o )), π 1 (σ̃o )) in P 0 × P × Π1 , where for each n and i, pin (σ̃o ) depends on the choice by oin,1 , and qn0 (σ̃o ) and π̄n1 (σ̃o ) depend on the choices by o0n,2 and o1n,2 respectively. After each pure-strategy profile s̃o of the outsiders there follows a copy of the game Γ(δ, p̂(s̃o )). That is, if in the profile s̃o outsiders oin,1 choose points vni , then there follows a copy of Γ(δ, p̂) after these choices, where for each n, p̂n = (p0n (vn0 ), p1n (vn1 )). However, the information sets in Γ̃δ are such that the insiders play without knowing which copy of Γ(δ, p̂) they are playing. The sets of duplicate strategies available are therefore now denoted by Tn0 (δ) and Tn1 (δ), omitting the reference to p0n and p1n , since the insiders are uninformed about which mixtures were implemented by outsiders. Put differently, for i = 0, 1 and tn ∈ Sn0 , the exact duplicate strategy implemented by choosing tin (δ) ∈ Tni (δ) depends on the choice by the outsider oin,1 that insiders do not observe. Thus in the metagame Γ̃δ , player n’s set of pure strategies, up to duplication of pure strategies, is S̃n (δ) ≡ Sn ∪ Tn0 (δ) ∪ Tn1 (δ). That the metagame Γ̃δ embeds Γ follows from Proposition 3.3. The strategies in Sn are available as pure strategies in S̃n (δ) and the other pure strategies, which belong to Tni (δ), for i = 0, 1, implement mixtures in Σn that depend on the choices of outsiders oin,1 . AXIOMATIC EQUILIBRIUM SELECTION 17 5.4. Outsiders’ Payoffs in the Metagames. Next we describe the outsiders’ payoffs in each metagame Γ̃δ . The payoffs to o0 depend on the choices of all outsiders except outsiders oin,3 for i = 0, 1 and n = 1, 2. Recall that the convex function γ is linear over each full-dimensional polyhedron T of T . This linear function extends uniquely to a linear function γT over P 0 × P × Π1 . Every mixed strategy profile of the other outsiders induces a unique point (q̃, p̂, π 1 ) ∈ P 0 × P × Π1 and o0 ’s payoff from choosing T is γT (q̃, p̂, π 1 ). Outsider oin,1 wants to mimic oin,3 . In particular, if he chooses vni and oin,3 chooses wni then his payoff is one if vni = wni and zero otherwise. Outsider o0n,2 wants to mimic the actual choice implemented by player n when this choice belongs to Pn0 . Similarly, outsider o1n,2 wants to mimic the πn1 implied by n’s choice when he i plays a strategy in Pn1 . Specifically, for i = 0, 1, let ϕin : RZn → R be the function given by P i ϕin (r) = z∈Zni rz2 . For each r ∈ RZn , let ξni (r, ·) be the affine approximation to ϕin at r, i.e. P i for each r′ ∈ RZn , ξni (r, r′ ) = z (rz2 + 2rz (rz′ − rz )). Suppose now that oin,2 chooses a pure 1 0 strategy si,δ n,2 and n chooses a pure strategy s̃n in S̃n (δ). If s̃n is in Tn (δ) or Tn (δ) then let qn be the actual strategy in Pn that is implemented based on p0n (vn0 ) or p1n (vn1 ) where for each i, vni is the choice of outsider oin,1 ; and otherwise let qn = s̃n . For outsider o0n,2 , his payoff is 0 1 zero if qn ∈ / Pn0 ; otherwise, it is ξn0 (s0,δ n,2 , qn ). For outsider on,2 , his payoff is zero if qn ∈ Pn ; 1 otherwise it is ξn1 (s1,δ n,2 , π̄n (qn )). The payoff to outsider oin,3 depends on the choices of all other outsiders. If o0 chooses a polyhedron T then there exists a unique multisimplex L of L that contains T . For each vertex wni of Vni , and each vertex ṽ of L, let uin,2 (T, ṽ, wni ) = 1 if wni is the image of ṽ under g̃ni and zero otherwise, where g̃ni is the (n, i)-th coordinate map of g̃. The function uin,2 extends multilinearly to L and, since L is full-dimensional, to the whole of P 0 × P × Π1 , denoted still by uin,2 (T, ·, wni ). Given an arbitrary mixed strategy of the other players, if o0 chooses T and oin,2 chooses wni then the payoff of oin,2 is uin,2 (T, (p, q), wni ), where (p, q) is the point in P 0 × P × Π1 induced by the mixed strategies of the other players. 5.5. Outsiders’ Strategies in a Quasi-Perfect Equilibrium. If Q∗ is a solution of Γ then, by Axioms B and S, in the metagame Γ̃δ there exists a quasi-perfect equilibrium b̃δ whose equivalent mixed-strategy profile σ̃ δ belongs to a solution and whose image under the map from the metagame Γ̃δ to Γ is a point in Q∗ . Each player n’s strategy in b̃δ necessarily has the following feature. He avoids going to his information set where his choices are among the strategies in Sn1 (δ) ∪ Tn1 (δ), since each of 18 SRIHARI GOVINDAN AND ROBERT WILSON these strategies chooses a non-equilibrium action at some information set on the equilibrium path. Let qn0,δ be n’s actual strategy in Pn0 that is implemented by n’s strategy in the profile b̃δ in the metagame Γ̃δ . By construction, qn0,δ belongs to Q∗n . Let x̃δ ≡ (q̃ 0,δ , p0,δ , π̃ 1,δ ) be the point in P 0 × P × Π1 that is induced by the profile σ̃oδ of the outsiders’ strategies in the equilibrium σ̃ δ . Under b̃δ , after n has rejected all strategies in Sn0 and Tn0 (δ), consider the strategy implemented by n. Let (1 − αn1,δ ) be the total probability of choosing a duplicate in Tn1 (δ) under b̃δ . Then n’s choice at this information set is equivalent to an enabling strategy in Pn of the form 1,δ 1,δ 1,δ q̌nδ ≡ (1 − αn1,δ )((1 − δ)p̌0,δ n + δpn ) + αn rn , where: (i) p̌0n (δ) is the mixture over strategies tn such that the strategy t1n (δ) is played i i with positive probability at this information set; (ii) p1,δ n = pn (σ̃n,1 ) is the enabling strategy i of outsider oin,1 ; (iii) rn1,δ is the enabling strategy in induced by the equilibrium strategy σ̃n,1 Pn1 that is obtained from n’s actual mixture over strategies in Sn1 if αn1,δ > 0, and is arbitrary otherwise. Let −1 1,δ 1,δ q 1,δ = (δ(1 − αn1,δ ) + αn1,δ ) (δ(1 − αn1,δ )p1,δ n + αn rn ) . Then π̄n1 (q̌nδ ) = π̄n1 (qn1,δ ) ≡ πn1,δ . The following lemma characterizes the important aspects of the outsiders’ equilibrium strategies. Lemma 5.2. The equilibrium strategies of the outsiders satisfy the following properties. (1) For each n, suppose the vertices in the support Vni,δ of oin,3 ’s equilibrium strategy span a simplex Kni,δ of Kni . Then pni,δ belongs to Kni,δ . (2) If every polyhedron in the support of o0 ’s strategy contains x̃δ then, for each n and i, the vertices in Vni,δ span a simplex Kni,δ , and g̃n,i (x̃δ ) belongs to the interior of a simplex K̄ni,δ that has Kni,δ as a face. (3) Every polyhedron in the support of o0 ’s strategy contains x̃δ . (4) q̃n0,δ is within δ of qn0,δ and π̃n1,δ is within δ of πn1,δ . Proof of Lemma. Outsider oin,1 wants to mimic outsider oin,3 . So, if the vertices of Vni,δ span a simplex Kni,δ then the payoff to oin,1 from choosing a vertex wni is positive if it belongs to Vni,δ, and zero otherwise. Point (1) follows. Let L̄ = L0 × (L̃0 × L1 ) × LΠ be the unique multisimplex of L that contains x̃δ in its interior. For each polyhedron T in the support of o0 ’s strategy, there exists a full-dimensional multisimplex L̂ of L that contains T . Obviously L̂ has L̄ as a face. oin,3 ’s payoff from choosing AXIOMATIC EQUILIBRIUM SELECTION 19 a strategy wni if o0 chooses such a T , and given the strategies of the other outsiders, is positive if it is the image of a vertex of L̄ under g̃ni and zero otherwise. Since the image of the vertices of L̄ under the coordinate function g̃ni span a simplex K̄ni,δ , g̃ni (x̃δ ) ∈ K̄ni,δ and the vertices of δ span a face of K̄ni,δ . Therefore, point (2) follows. Vn,i For each polyhedron T of T , o0 ’s payoff from T is γT (x̃δ ) and by construction, γT (x̃δ ) 6 γ(x̃δ ) with the inequality being strict iff x̃δ does not belong to T , which proves (3). It remains to prove (4). The actual strategy implemented by n is qn0,δ , which belongs to Pn0 . o0n,2 ’s payoff function is such that his best replies to qn0,δ are the points in Sn0,δ that are closest to qn0,δ . Thus q̃n0,δ is within δ of qn0,δ . Since qn0,δ belongs to Pn0 , all of o1n,2 ’s strategies yield a payoff of zero against qn0,δ . However, since the behavioral strategy b̃δ is a quasi-perfect equilibrium, there exists a sequence of completely mixed behavioral strategies b̃ε,δ converging δ is a best reply. Under the sequence b̃ε,δ , to b̃δ against which o1n,2 ’s equilibrium strategy σ̃n,2 there is a positive probability that player n moves into the third stage of his choice and δ makes a choice among strategies in Sn1 ∪ Tn1 (δ). The fact that σ̃n,2 is optimal against the sequence implies that it is optimal against the limiting choice q̌nδ there. Since π̄n1 (q̌nδ ) = πn1,δ , o1n,2 ’s best replies are within δ of πn1,δ and thus πn1,δ is within δ of πn1,δ as well. ¤ 5.6. Limits of the Quasi-Perfect Equilibria of the Metagames. Consider a sequence of δ’s converging to zero and a corresponding sequence b̃δ of quasi-perfect equilibria in solutions of the metagames Γ̃δ . Let σ̃ δ be an equivalent sequence of mixed strategies and let (q̃ 0,δ , (p0,δ , p1,δ ), π̃ 1,δ ) be the sequence in Q∗ × P × Π1 induced by the outsiders’ strategies. Let q̃ 0,0 , (p0,0 , p1,0 ), and π̃ 1,0 be the corresponding limits of q̃ 0,δ , (p0,δ , p1,δ ), and π̃ 1,δ . Let q 0,0 and π 1,0 be the limits of q 0,δ and π 1,δ . By properties (1)-(3) of the previous lemma, for each δ, n, i, there exists a simplex K̄ni,δ of Kni that contains both g̃n,i (q̃ 0,δ , (p0,δ , p1,δ ), π̃ 1,δ ) and pi,δ n , with the former belonging to its interior. By property (4) of the previous lemma, q̃ 0,0 = q 0,0 and π̃ 1,0 = π 1,0 . By passing to a subsequence, we can assume that there exist multisimplices L̄ of L and K̄ of K such that for all δ, (q̃ 0,δ , (p0,δ , p1,δ ), π̃ 1,δ ) belongs to the interior of L̄ and its image under g̃ belongs to the interior of K̄—hence K̄ also contains (p0,δ , p1,δ ). Going to the limit, (q 0,0 , (p0,0 , p1,0 ), π 1,0 ) belongs to L̄ and its image under g̃ belongs to K̄; also (p0,0 , p1,0 ) belongs to K̄. Since L̄ and K̄ are multisimplices and g̃ is a multisimplicial map, we have that for all δ, xδ ≡ (q 0,0 , (p0,δ , p1,0 ), π 1,0 ) belongs to L̄, and g̃(xδ ), (p0,δ , p1,0 ) ∈ K̄. If we show that for small δ, xδ belongs to Q, then by construction this implies that the degree of ψ is nonzero and xδ ∈ U (x∗ ), and therefore q 0,0 ∈ U (q 0,∗ ), which proves the theorem. The remainder of the proof establishes this fact. 20 SRIHARI GOVINDAN AND ROBERT WILSON 5.7. Insiders’ Strategies in a Quasi-Perfect Equilibrium. Let b̃ε,δ be a sequence of completely mixed behavioral strategies converging to b̃δ against which for each insider n and each information set of n in Γ̃δ , his continuation strategy is optimal. We now study the important features of both the sequence and its limit. If n chooses sn ∈ Sn0 in the first stage then in the second stage he has the option of revising this strategy to play something in Tn0 (δ). Therefore, quasi-perfection implies that player n will end up implementing sn with positive probability in b̃δn only if this strategy is at least as good a reply against the sequence b̃ε,δ as the strategies in Tn0 (δ). Likewise, player n has the option of playing a strategy in Tn0 (δ) before he decides to play a strategy in Sn1 and even when he makes a choice among these strategies, he has the option of choosing a strategy in Tn1 (δ). Therefore, at the information set that follows his choice of avoiding strategies in Sn0 , b̃δn assigns a positive probability to moving on to a third stage and then choosing a strategy in Tn1 (δ) ∪ Sn1 only if one of these strategies is at least as good a reply against the sequence b̃ε,δ as all the strategies in Tn0 (δ). Furthermore, at the information set obtained after n avoids strategies in Sn0 ∪ Tn0 (δ), b̃δn assigns a positive probability to a strategy sn in Sn1 only if sn is at least as good a reply against the sequence b̃ε,δ as all the strategies in Sn1 ∪ Tn1 (δ). Let σ̃ ε,δ be a sequence of mixed strategy profiles that is equivalent to the sequence b̃ε,δ of behavioral strategy profiles. For each player n, his strategy σ̃nε,δ in the sequence is a mixture over his pure strategy set S̃n = Sn ∪ Tn0 (δ) ∪ Tn1 (δ). However, the implications of n’s strategy (for m’s choices) depend on the choices of the outsiders through their implications for strategies in Tn0 (δ) and Tn1 (δ). Each strategy tin (δ) plays tn with probability (1 − δ) and with probability δ plays a strategy in Pni that is determined by oin,1 ’s strategy. In order to fully capture the impact that oin,1 has on tin (δ), let Wni be the vertex set of Kni . Let T̄ni (δ) be the union over all wni ∈ Wni of the sets Tni (δ, pin (wni )). Let S̄n = Sn ∪ T̄n0 (δ) ∪ T̄n1 (δ) and let Σ̄n be the set of mixtures over S̄n . Since Wni is the pure strategy set of outsider oin,1 , and σ̃ ε,δ is a sequence of completely mixed has Wni as its support for all ε in the sequence. The sequence σ̃ ε,δ induces strategies, σ̃oε,δ i n,1 a mixed strategy σ̄nε,δ in S̄n for each n as follows. For each sn ∈ Sn , the probability σ̄nε,δ (sn ) of sn is σ̃nε,δ (sn ). For each i = 0, 1, wni ∈ Wni and tn ∈ Sn0 , the probability σ̄nε,δ (tin (δ, pin (wni ))) i is σ̃nε,δ (tin (δ))σ̃oε,δ From player m’s perspective it is the sequence σ̄nε,δ , or rather its i (wn ). n,1 equivalent sequence in Pn , that matters for his choice. AXIOMATIC EQUILIBRIUM SELECTION 21 5.8. The Induced Lexicographic Probability System. By Blume, Brandenburger, and Dekel [1, Appendix Proposition 2], we can construct for each player n a lexicographic probl (δ),δ ability system (LPS) Λ̄δn = (σ̄n0,δ , . . . , σ̄nn a subsequence converging to zero, ) over his strategies in S̄n such that for each ε in σ̄nε,δ = (1 − ν0 (ε))(σ̄n0,δ + ν0 (e)((1 − ν1 (ε))σ̄n1,δ + ν1 (ε)((1 − ν2 (ε))σ̄n1,δ + · · · + νln (δ)−1 (ε)σ̄nln (δ),δ )) , l (δ) where (ν0 (ε), . . . , νln (δ)−1 (ε)) is a sequence in R+n converging to the origin. Quasi-perfection implies that σ̄n0,δ is equivalent to the enabling strategy qn0,δ in Γ and it is a lexicographic best reply to Λ̄δm . Let ln0,δ be the first level in Λ̄δn at which some strategy in T̄n0 (δ) appears with positive probability. Let ln1,δ be the first level of Λ̄δn at which some strategy in Sn1 ∪ T̄n1 (δ) appears with positive probability. From the LPS Λ̄δn construct an LPS Λδ = (qn0,δ , . . . , qnln ,δ ) for Γ where for each l, qnl,δ is an enabling strategy in Γ that is equivalent to σ̄nl,δ . qn0,δ is a lexicographic best reply against Λδm . i,δ For each wni ∈ Wni , the total probability under σ̄nln of the strategies in Tni (δ, pin (wni )) is the i,δ i i δ limit σ̃oδi (wni ) of the sequence σoε,δ i (wn ). Since pn (σ̃oi ) is, by definition, pn , we have that n,1 level lni,δ n,1 is expressible as a convex combination n,1 li ,δ λnn pi,δ n li ,δ li ,δ li ,δ + (1 − λnn )rnn with λnn > 0 and li ,δ l1 ,δ rnn ∈ Pn . Since ln1,δ is the first level of Λδ that does not have its support in Pn0 , π̄n1 (qnn ) equals πn1,δ , which recall was computed from n’s strategy under b̃δ in the third stage of his choice. 0,δ We claim that for each l 6 ln0,δ , qnl,δ induces the equilibrium outcome against qm . Indeed, 0,δ ∗ 0 since qm belongs to Q , and the strategies in Tn (δ) always implement some strategy in Pn0 0,δ regardless of the choice of outsider o0n,1 , all the strategies in Tn0 (δ) are optimal against qm . By quasi-perfection, this implies that every strategy in S̄n that appears at a level l 6 ln0,δ of 0,δ 0,δ 0,δ Λ̄δn must be a best reply to qm . Thus for all l 6 ln0,δ , qnl,δ is a best reply to qm . Since qm is δ 0,δ a lexicographic best reply to Λn , (q̃n (ε), qm ) is an equilibrium of the game Γ for all small ε, where 0,δ 0,δ q̃n (ε) = (1 − ε) qn0,δ + ε((1 − ε)qn1,δ + ε2 ((1 − ε)qn2,δ + ε3 (· · · + εln qnln ) . By genericity of payoffs, Γ has finitely many equilibrium outcomes, so each of these equilibria induces the same equilibrium outcome—hence the claim follows. Three implications of this l1 ,δ claim are: (i) ln1,δ > ln0,δ ; (ii) the enabling strategy rnn in the previous paragraph belongs to Pn0 ; (iii) all levels up to ln0,δ prescribe the same mixture at each information set on the equilibrium path and differ only at information sets excluded by (all of) m’s equilibrium strategies in Q̄∗m , which is the ρm -image of the component Σ̄∗m of equilibria. 22 SRIHARI GOVINDAN AND ROBERT WILSON 5.9. Limit of the Lexicographic Probability System. Take a subsequence of the δ’s such that the following properties hold for the associated LPSs Λδn for each n: (i) lni,δ is independent of δ for each i, call it lni ; (ii) for each l 6 ln1 , the face of Pn that contains qnl,δ in its interior, as well as the strategies for m that are best replies to qnl,δ , are independent of δ. l1 ,δ l1 ,δ Let σnn be a sequence of strategies in Σn that is equivalent to the sequence qnn . Again using Blume, Brandenburger, and Dekel [1, Appendix Proposition 2], there is now an LPS l1 ,0 l1 ,kn (σnn , . . . , σnn ) and a sequence (µ0 (δ), . . . , µkn −1 (δ)) ∈ Rkn converging to zero such that for l1 ,δ a subsequence of δ’s, σnn is expressible as the nested combination 1 1 1 1 1 σnln ,δ = (1 − µ0 (δ))(σnln ,0 + µ0 (δ)((1 − µ1 (δ))σnln ,1 + µ1 (δ)((1 − µ2 (δ))σnln ,2 + · · · + µkn −1 (δ)σnln ,kn )) l1 ,kn l1 ,0 This LPS induces an equivalent LPS (qnn , . . . , qnn Let kn1 be the first level k of this LPS where l1 ,δ previous section that qnn l1 ,k qnn ) in enabling strategies. does not belong to Pn0 . Recall from the l1 ,δ l1 ,δ l1 ,δ n n is expressible as a convex combination λnn p1,δ n + (1 − λn )rn l1 ,δ l1 ,δ and that π̄n1 (qnn ) = πn1,δ . Express rnn as a convex combination αn0,δ rn0,δ + αn1,δ rn1,δ , where for l1 ,δ l1 ,k l1 ,d i = 0, 1, rni,δ ∈ Pni . Then λnn + (1 − λnn )αn1,δ > 0 since qnn does not belong to Pn0 . Going to −1 l1 ,δ l1 ,δ l1 ,δ a subsequence, let λ1n be the limit of (λnn + (1 − λnn )αn1,δ ) λnn and let rn1,0 be the limit of l1 ,k1 1,0 n n is expressible as a convex combination rn1,δ . Since the limit of p1,δ n is pn , we have that qn 1 1,0 1 1,0 0 ζn (λn pn + (1 − λn )rn ) + (1 − ζn )p̌n for some ζn > 0 and p̌0n ∈ Pn0 . Moreover, since πn1,0 is 1 l1 ,kn the limit of πn1,δ , π̄n1 (qnn ) = πn1,0 . 1 +1,δ l1 +kn For each δ in the subsequence used above, define now an LPS Λ̂δn = (q̂n0,δ , . . . , q̂nn 1 ,l−l1 −1 ln n follows: q̂n0,δ = qn0,0 , q̂nl,δ = qnl−1,δ if 0 < l 6 ln1 , and q̂nl,δ = qn ) as otherwise. The strategy q̂nl,δ l0 +1,δ is independent of δ for l = 0 and l > ln1 . Each level l < ln1 + kn1 + 1 is a strategy in Pn0 . q̂nn l0 ,δ l0 ,δ l0 ,δ n n is a convex combination λnn p0,δ n + (1 − λn )rn 1 +1,δ l1 +kn while q̂nn 1 +k 1 ,δ ln n 1 1,0 0 1 ζn (λ1n p1,0 n + (1 − λn )rn ) + (1 − ζn )p̌n where ζn > 0 and π̄n (q̂n is a convex combination ) = πn1,0 . The next lemma sets out the key properties of Λ̂δn that lead to a conclusion of our proof. Lemma 5.3. For all small δ, the LPS Λ̂δn satisfies the following properties. (1) A strategy is at least as good a reply against Λδn as another only if it is at least as good a reply against Λ̂δn . 0,0 and is at least as good a reply (2) If λ1n < 1, then the strategy rn1,0 is a best reply to qm lexicographically against Λδm as every strategy in Sn1 . 0,0 . (3) λ1n > 0 and every level l < ln1 + kn1 + 1 induces the equilibrium outcome against qm AXIOMATIC EQUILIBRIUM SELECTION l0 ,δ 23 l0 ,δ (4) The strategy q̂nl,δ for l < ln0 and the strategy rnn if λnn < 1 are lexicographic best δ replies against q̂m . Proof of Lemma. Suppose sm is a better reply against Λ̂δn than another strategy tm . We show that sm is also a better reply against Λδn . Let l be the first level of Λ̂δn such that sm is l,δ than tm . If l = 0 then for all small δ, sm is a better reply against a better reply against q̂m 0,δ 0,δ qn since q̄n equals the limit qn0,0 of qn0,δ ; thus against Λδn , tm is a worse reply against the very first level. If 0 < l < ln1 + 1 then obviously sm is better reply against Λδn than tm since level l of Λ̄δn corresponds to level l − 1 of Λδn . Suppose then that ln1 + 1 6 l 6 ln1 + kn1 + 1. l1 ,δ l1 ,δ Then for all small δ, sm is a better reply against qnn since qnn is a nested combination of l1 ,0 l1 ,kn (qnn , . . . , qnn ). Thus, sm is a better reply against Λδn . This proves (1). In the game Γ̃δ player n, when finally making a choice among the strategies in Sn1 ∪ Tn1 (δ), would choose a strategy sn in Sn1 with positive probability only if it is at least as good a reply against Λδn as the other strategies in Sn1 . Therefore, such a strategy would show up with positive probability under level ln1 of Λ̃δn (and hence in Λ̄δn ) only if this is the case. This l1 ,δ implies that if (1 − µnn )αn1 is positive, then the strategy rn1,δ is at least as good a reply against Λδn as the strategies in Sn1 . Recall that rn1,0 is the limit of rn1,δ and λ1n is the limit l1 ,δ 1 −1 l1 ,δ of (µnn + (1 − µln ,δ )αn1,δ ) µnn . Therefore, by point (1) of this lemma, rn1,0 is at least as good a reply against Λ̂δn as strategies in Sn1 if λ1n < 1. To show that it is also a best reply 0,0 against qm , suppose to the contrary that it is not. Then every strategy in Tn1 (δ), regardless 0,δ of outsider o1n,1 ’s choice, is a better reply against qm for all small δ, since for δ = 0 these 0,0 strategies are in Pn0 , which are best replies to qm . Therefore, quasi-perfection implies that 1 n would prefer to play the strategies in Tn (δ) rather than implementing rn1,0 (or rn1,δ when δ 1 is small), which shows that (1 − µln ,δ )αnl,δ = 0 for all small δ and hence that λ1n = 1. This proves (2). We turn now to (3). Every strategy q̂nl,δ for l < ln1 +kn1 +1 belongs to Pn0 and is thus optimal 0,0 against qm , which belongs to Q∗m . The strategy p̌0n (which recall is part of the expression l1 ,k1 defining qnn n ) which is chosen with positive probability is also in Pn0 and hence optimal 0,0 against qm . As we saw in the previous paragraph, if λ1n < 1, the strategy rn1 must also be 0,0 0,0 0,δ optimal against qm . Obviously qm is optimal against Λ̂δn since it is the limit of qm , which 0,0 , qn (ε)) is an equilibrium by point (1) is optimal against Λ̂δn . Therefore, for all small ε, (qm 24 SRIHARI GOVINDAN AND ROBERT WILSON of Γ, where q̃n (ε) = 1 µ ln1X +kn l ε + (1 − 1λ1n >0 )ε 1 +k 1 +1 ln n l=0 1 ¶−1 µ ln1X +kn εl q̂nl,δ + (1 − 1 1 1 1 1λ1n >0 )εln +kn +1 q̂nln +kn +1 l=0 1 +1 l1 +kn where 1λ1n >0 is the indicator function. This is impossible if λ1n = 0, since q̂nn Pn0 Pn1 . ¶ is a λ1n convex combination of a strategy in and one in Thus > 0. Moreover, since this is a continuum of equilibria, genericity implies that all of them induce the same outcome. Therefore, all strategies at levels preceding ln1 +kn1 +1 induce the equilibrium outcome against 0,0 qm . Lastly we prove (4). By the previous paragraph, all strategies in Pn0 are optimal against 1 1 k,δ q̂m for k 6 lm + km . Thus the optimality of a strategy in Pn0 depends on how it fares against 1 +1 l1 +km q̂mm . If a strategy sn ∈ Sn0 is not optimal against this strategy, then for all small δ, 1 l1 ,km some strategy in Tn0 (δ) is a superior reply to q̂mm regardless of what outsider o0n,1 ’s choice is. Thus, such a duplicate strategy is a better reply against Λδm than sn . Therefore, in the quasi-perfect equilibrium of Γδ , player n when given a choice of actually implementing sn will prefer this duplicate strategy in Tn0 (δ), i.e. the probability of sn is zero for all l 6 ln0 in Λδn . Point (4) now follows. ¤ 5.10. Final Step of the Proof. Fix a small δ that satisfies the properties enumerated in Lemma 5.3 of the previous subsection. Let µ X ¶−1 µ X ¶ 0 l l l,δ q̄n (ε) = ε ε q̂n 0 +1 l6ln 0 +1 l6ln 1 1 µ ln1 +k ¶−1 µ ln1 +k ¶ n +1 n +1 X X 1 l l l,δ q̄n (ε) = ε ε q̂n . 0 +2 l=ln 0 +2 l=ln Observe that q̄n0 (ε) belongs to Pn0 for all ε and it is a convex combination of p0,δ n and a subset Rn0 of strategies that are at least as good replies against Λ̂δm as other strategies in Sn0 . Likewise q̄n1 (ε) is a convex combination of p1n , rn1 and a point in Pn0 such that the strategy rn1 if it has a positive weight is at least as good a reply against Λ̂δm as other strategies in Pn1 . There exists a small ε such that a strategy sm is at least as good as a strategy tm against Λ̂δn iff it is lexicographically at least as good a reply against the LPS (q 0 , q̄n0 (ε), q̄n1 (ε)). Since π̄n1 (q̄n1 (ε)) = π̄n1 (qn1,0 ) = π 1,0 for all ε, it follows that (q 0 , (p0,δ , p1,0 ), π 1,0 ) belongs to Q. As we noted earlier, proving that this point belongs to Q shows that in fact it belongs to U (x∗ ) and hence that q 0 in U (q 0,∗ ), which completes the proof. ¤ AXIOMATIC EQUILIBRIUM SELECTION 25 Remark 5.4. In the case where Sn1 is empty for exactly one player n, as we said initially in the description of P and Q, we do not have the factor Pn1 or Π1n . In the family of games Γ(δ, p̂), player n decides provisionally in the first stage on the strategy in Sn to play and in the second stage gets to execute it or switch to playing a strategy in Tn (δ, p̂). In the metagame, we do not have outsiders o1n,j for j = 1, 2, 3. The rest of the analysis is essentially the same modulo these provisions. 6. Concluding Remarks Like our previous paper [10] on forward induction, the characterization in this paper is a step toward a theory of equilibrium refinement using axioms adapted from decision theory. Theorem 5.1 is confined to games in extensive form with perfect recall, two players, and generic payoffs, but it suggests that an extension to more general games might be possible. We hope also that Axiom B can be weakened to require only that a solution contains a sequential equilibrium for which each player’s strategy is conditionally admissible from each of his information sets—thus keeping the axioms entirely within standard decision theory. Previously, some proposed refinements selected equilibria with one or more desirable properties, like admissibility, subgame perfection, or sequential rationality. Other proposed refinements derived some properties from limits of equilibria of perturbed games, such as perfect, quasi-perfect, and proper equilibria. In the latter approach, a key step forward was Kohlberg and Mertens’ [14] argument that an axiomatic development requires set-valued refinements. Their program achieved remarkable success with Mertens’ [19] definition of a stable set, which has the desirable properties 1, 2, 3 listed in Section 1 and others too, such as ordinality and immunity to splitting players into agents. However, an axiomatic theory of refinement should be based on basic principles of rational behavior in the game at hand, as in decision theory. This precludes reliance on perturbed games obtained by perturbing players’ sets of feasible strategies. The challenge, therefore, has been to establish why consideration of perturbed games yields the requisite decisiontheoretic properties. Our answer here begins with Axiom S, which generalizes the invariance criterion of Kohlberg and Mertens’ [14] and the small worlds criterion of Mertens’ [21], as explained in Section 3.3. Absent a strong invariance property like Axiom S, a refinement is vulnerable to ‘framing effects’ depending on wider contexts in which the given game might be embedded. In decision theory, such effects were examined by Savage [28], and in cognitive psychology they play a prominent role in interpreting decisions by subjects in experiments, as for instance in Kahneman and Tversky [13]. For a theory of thoroughly rational behavior, however, an 26 SRIHARI GOVINDAN AND ROBERT WILSON axiom should exclude framing effects. Axiom S does this by requiring a solution of a game to be consistent with the solution of any metagame in which it is embedded. As shown in Proposition 3.4, it is already true of any equilibrium that it is consistent with an equilibrium of any metagame in which the game is embedded. Axiom S merely extends to refinements this fundamental invariance property of equilibria. Our answer continues with the proof of Theorem 4.1 in Appendix B. There it is shown that a set Q∗ of equilibria in enabling strategies is stable iff the corresponding projection map from the pseudomanifold Q to the space P of enabling strategies is essential. Using this key property, the proof of Theorem 5.1 shows that for each equilibrium in a component of undominated equilibria there exists a corresponding metagame for which the equilibrium is the image of a quasi-perfect equilibrium in the metagame if and only if the projection map is essential. Hence Axioms A, B, S imply that a solution is a stable set, and conversely due to Mertens’ previous proofs. The answer to the ‘why’ question above is thus that, given Axioms A and B, stability with respect to perturbed games is equivalent to an analogous ‘stability’ with respect to embeddings in metagames, as required by Axiom S. Because in practice every game is embedded in some wider context, we view Axiom S’s requirement that a refinement is immune to presentation effects as the relevant criterion from the perspective of decision theory. This view is reinforced by the facts that Nash equilibria satisfy Axiom S, and that together with Axioms A and B, the implied refinement agrees with stability based on perturbed games. For a refinement satisfying the axioms, Theorem 5.1 establishes that a solution of a game must be a component of its undominated equilibria, and that the component must be essential. Because payoffs are assumed to be generic, all equilibria in the component have the same paths of equilibrium play and thus the same distribution of outcomes. Therefore the main implication for equilibrium refinements is that a predicted outcome distribution should result from equilibria in an essential component of undominated equilibria, and in particular, from the sequential equilibria it necessarily contains. A secondary implication is that after deviations from equilibrium play, the continuations of all equilibria in the component remain admissible and sequentially rational, where those that are not sequential equilibria of the original game are justified by beliefs induced by quasiperfect equilibria of corresponding metagames that embed the given game. This provides one resolution of the conundrum posed by Reny [25, 26, 27]. AXIOMATIC EQUILIBRIUM SELECTION 27 1 2 1 a b 1 2 c e 1 — 2: ac aef bd bgh ab a a b b cdeg c e d g cdfh c f d h d f g h Figure 1. A game tree and its pure reduced normal form in which each pure strategy is identified by the terminal nodes it does not exclude. Appendix A. Enabling Strategies In the normal-form representation of a game in extensive form, a player’s pure strategy specifies the action chosen at each of his information sets in the game tree. However, outcomes are not affected by a strategy’s actions at information sets excluded by his previous actions. One therefore considers equivalence classes of pure strategies. Say that two pure strategies are outcome equivalent if the sets of terminal nodes they do not exclude are the same. For instance, the game in Figure 1 is shown on the left side in extensive form and on the right side in the ‘pure reduced normal form’ (PRNF) introduced by Mailath, Samuelson, and Swinkels [18]. In the PRNF, each outcome-equivalent class of player 1’s pure strategies is identified by the terminal nodes it does not exclude, as indicated by labels of rows along the left side; and each equivalence class of player 2’s pure strategies is identified by the terminal nodes it does not exclude, as indicated by labels of columns along the top. Because this game has no moves by Nature, each row and column determine a unique outcome that is the intersection of the row and column labels, shown as the corresponding entry in the matrix. A similar example is shown in Figure 2 for the game tree of a signaling game. In this case, each profile of pure strategies determines a pair of outcomes such that the first or second outcome occurs depending on whether Nature’s initial move is up or down. For instance, the outcome of 1’s strategy abcd and 2’s strategy aceg is a with probability p and c with probability 1 − p. Say that a terminal node that is not excluded is an enabled outcome. A pure strategy of a player enables outcome z if it chooses all his actions on the path to z. A player’s mixed strategy randomizes over his pure strategies, whereas a behavioral strategy randomizes over 28 SRIHARI GOVINDAN AND ROBERT WILSON a e 1 b f p 2 2 1−p c d 1 g 1—2: aceg acfh bdeg abcd ac ac bd abgh ag ah bg efcd ec fc ed efgh eg fh eg bdfh bd bh fd fh h Figure 2. The game tree of a signaling game and its pure reduced normal form in which each pure strategy is identified by the terminal nodes it does not exclude. actions at each of his information sets. A strategy of either kind induces a probability distribution over outcome-equivalent classes of his pure strategies, and thus a distribution over enabled outcomes. Such a distribution is called an enabling strategy. A point pn ∈ [0, 1]Z is an enabling strategy for player n if it is the distribution over enabled outcomes induced by some mixed or behavioral strategy, i.e. pn (z) is the mixed strategy’s probability of those pure strategies that enable outcome z. The vertices of the polyhedron Pn of n’s enabling strategies correspond to outcome-equivalent classes of n’s pure strategies in the PRNF, as in Figures 1 and 2. Enabling strategies are minimal representations of strategic behavior in games with perfect recall.9 Let p∗ (z) be the probability that Nature’s strategy enables outcome z, which is 1 if Nature Q has no moves. Then for each profile p ∈ P = n Pn of players’ enabling strategies, the Q probability that outcome z results is γz (p) = p∗ (z) n pn (z), because Nature and the players randomize independently. The extensive form is therefore summarized by the multilinear function γ : P → ∆(Z) ⊂ RZ that assigns to each profile of players’ enabling strategies a distribution over terminal nodes, including the effect of Nature’s enabling strategy. Player P n’s expected payoff is Gn (p) = z γz (p)un (z). The game Γ is therefore summarized by the multilinear function G : P → RN that assigns to each profile of players’ enabling strategies their expected payoffs. This summary specification is called the enabling form of the game. Appendix B. Proofs of Propositions B.1. Proof of Proposition 3.3. A multilinear map fn : Σ̃n × Σo → Σn is completely specified by its values at profiles of pure strategies. We use fˆn to denote the restriction of fn to the set S̃n × So of profiles of pure strategies. 9Mertens [21, p. 554] introduces the mapping of mixed strategies to induced distributions on terminal nodes. Koller and Megiddo [15] call them realization plans. AXIOMATIC EQUILIBRIUM SELECTION 29 Proposition B.1. G̃ embeds G via a collection of multilinear maps f = (fn )n∈N if and only if for each player n there exists T̃n ⊆ S̃n and a bijection πn : T̃n → Sn such that for each (s̃, so ) ∈ S̃ × So and t̃n ∈ T̃n : (1) fˆn (t̃n , so ) = πn (t̃n ), (2) G̃n (s̃, so ) = Gn (fˆ(s̃, so )), where fˆ = (fˆn )n∈N . Proof. Suppose we have a game G̃ : Σ̃ × Σo → RN ∪ o and a collection of multilinear maps fn : Σ̃n × Σo → Σn , one for each n ∈ N , such that conditions (1) and (2) of the proposition are satisfied. Then, by condition (1) and multilinearity of fn for each n, for each fixed σo , fn (·, σo ) is surjective because it maps the face spanned by T̃n homeomorphically onto Σn . Also, condition (2) and multilinearity of each fn imply that G̃ = G ◦ f . According to Definition 3.2, therefore, (G̃, f ) embeds G. Now suppose that (G̃, f ) embeds G. Let σo be a profile of completely mixed strategies for outsiders. Because fn is multilinear it induces a linear mapping fn (·, σo ) from Σ̃n to Σn that, by the definition of an embedding, is surjective. Hence, for each sn ∈ Sn there exists a pure strategy t̃n (sn ) in S̃n that is mapped to sn by this linear map. We claim that fn (t̃n (sn ), so ) = P sn for all so ∈ So . Indeed, observe that fn (t̃n (sn ), σo ) = so fn (t̃n (sn ), so )σo (so ), where for each so , σo (so ) is the probability of so under σo . Therefore, since σo is completely mixed, if fn (t̃n (sn ), so ) 6= sn for some so then fn (t̃n (sn ), σo ), which is an average of values at vertices of So , cannot be sn . Thus, fn (t̃n (sn ), so ) = sn for all so . Let T̃n ⊂ S̃n be a collection comprising a different pure strategy t̃n (sn ) for each sn ∈ Sn and let πn be the associated bijection. Define fˆn : S̃n × So → Σn by fˆn (s̃n , so ) = fn (s̃n , so ). Then conditions (1) and (2) of the proposition are satisfied. ¤ B.2. Proof of Proposition 3.4. Proposition B.2. If (G̃, f ) embeds G then the equilibria of G are the f -images of the equilibria of G̃. Proof. Suppose (σ̃, σo ) is an equilibrium of G̃ and let σ = f (σ̃, σo ). For any insider n and his strategy τn ∈ Σn there exists τ̃n ∈ Σ̃n such that fn (τ̃n , σo ) = τn because fn (·, σo ) is surjective by condition (a) of Definition 3.2 an embedding. Using condition (b), Gn (τn , σ−n ) = Gn (f (τ̃n , σ̃−n , σo )) = G̃n (τ̃n , σ̃−n , σo ) 6 G̃n (σ̃, σo ) = Gn (f (σ̃, σo )) = Gn (σ) , where the inequality obtains because (σ̃, σo ) is an equilibrium of G̃. Hence σ is an equilibrium of G. 30 SRIHARI GOVINDAN AND ROBERT WILSON Conversely, suppose σ is an equilibrium of G. For each n, let πn be the bijection given by Proposition 3.3. Let σ̃n be the strategy for insider n in G̃ defined by σ̃n (t̃n ) = σn (πn (t̃n )) for t̃n ∈ T̃n and σ̃n (s̃n ) = 0 for s̃n ∈ / T̃n . Since fn is multilinear, by condition (1) of Proposition 3.3, fn (σ̃n , ·) = σn and thus f (σ̃, ·) = σ. Hence, it suffices to show that there exists a strategy profile σo for outsiders such that (σ̃, σo ) is an equilibrium of G̃. By fixing the profile of insiders’ strategies to be σ̃ one induces a game among outsiders. Let σo be an equilibrium of this induced game among outsiders. To see that (σ̃, σo ) is an equilibrium of G̃, observe that for each pure strategy s̃n of an insider n: G̃n (s̃n , σ̃−n , σo ) = Gn (fn (s̃n , σo ), σ−n ) 6 Gn (σ) = Gn (f (σ̃, σo )) = G̃n (σ̃, σo ) , where the first and second equalities use the property f (σ̃, ·) = σ established above, and the inequality obtains because σ is an equilibrium of G. ¤ Appendix C. Proof of Theorem 4.1 Theorem C.1. (Q, ∂Q) is a pseudomanifold of the same dimension as (P, ∂P). Moreover, ψ : (Q, ∂Q) → (P, ∂P) is an essential map iff Q∗ is stable. Proof. The proof invokes genericity of payoffs by assuming that certain points and polyhedra, identified as they arise during the proof, are in general position. See Appendix D for elaboration of the character of these genericity requirements. For any set X, we write d(X) for its dimension. For any subset Tn of Sn , let Pn (Tn ) be the convex hull (in Pn ) of the strategies in Tn . For simplicity, we write d(Tn ) for d(Pn (Tn )). See Sections 4.4 and 4.5 for additional notation used below. For any vertex πn1 of Π1n , let Sn1 (πn1 ) be the set of pure strategies that map to πn1 . For any face Ψ1n of Π1n , let Sn1 (Ψ1n ) be the set of strategies s1n such that π̄n1 (s1n ) ∈ Ψ1n . Let Hn0 be the set of information sets hn ∈ Hn \ Hn∗ of player n such that at the last information set h′n ∈ Hn∗ that precedes hn , the action there leading to hn belongs to A∗n , which is the set of his equilibrium actions. If a subset Tn0 of Sn0 is such that Pn (Tn0 ) contains an equilibrium in Q̄∗n , then for each first information set hn ∈ Hn0 there exists a strategy in Tn0 that enables hn . Let Hn1 = Hn \ (Hn∗ ∪ Hn0 ). Lemma C.2. Suppose Tn0 is a subset of strategies in Sn0 such that Pn (Tn0 ) contains an equilibrium qn∗ in Q̄∗n . If the strategies in Tn0 are at least as good replies against p0m ∈ Pm0 as other strategies in Sn0 , then all the strategies in Sn0 are equally good replies against p0m . ∗ Proof. Let Π∗n be the projection of Pn0 to RZ , where Z ∗ is the set of terminal nodes reached 0 . Consequently, the with positive probability under the equilibria in Q̄∗ . Then Z ∗ = Zn0 ∩ Zm AXIOMATIC EQUILIBRIUM SELECTION 31 payoff to a strategy s0n against p0m depends on s0n only through its projection to Π∗n . Since the projection of qn∗ —which is at least as good a reply as the other strategies in Sn0 against p0m —belongs to the relative interior of Π∗n , every strategy in Sn0 is a best reply against p0m . ¤ Lemma C.3. Suppose that Tn0 is a subset of Sn0 such that Pn (Tn0 ) is a face of Pn0 containing an equilibrium strategy in Q̄∗n . For each tn ∈ / Tn0 there exists an information set hn ∈ Hn0 that tn enables and where the action chosen by it is avoided by all t0n ∈ Tn0 that enable hn . Proof. Since Pn (Tn0 ) is a face of Pn0 , which is itself a face of Pn , there exists a linear function f : RZ → R that is zero on Pn (Tn0 ) and negative everywhere else on Pn . Fix pm in the interior of Pm and define a payoff function ũn for n by the equation: p0 (z)pm (z)ũn (z) = f (ez ) where ez is the z-th unit vector in RZ . Then when n’s payoff function is ũn , the strategies in Pn (Tn0 ) are the best replies against pm . Take tn ∈ / Tn0 . Since it is suboptimal against pm , there exists an information set hn where the action a chosen by tn is suboptimal. Then hn does not belong to Hn∗ : indeed, since tn belongs to Sn0 , a is an equilibrium action if hn ∈ Hn∗ ; and since Pn (Tn0 ) contains an equilibrium strategy, there exists a strategy in Tn0 that enables such an hn and chooses a. Let h′n be the last information set preceding hn that belongs to Hn∗ . Then obviously the action chosen by tn at h′n is an equilibrium action and thus hn belongs to Hn0 . Any strategy in Tn0 that enables hn avoids a, which proves the lemma. ¤ For the next lemma, it is worth recapitulating the exact definition of the set Π1n . Recall from Section 4.5 that we fix a completely mixed enabling strategy p̄m for player m and compute for each pn the total probability η(pn ) of reaching a terminal node in Zn1 under 1 1 (p̄m , pn ). Hn is a hyperplane in RZn that separates the projection ψZn1 (Pn1 ) of Pn1 to RZn 1 from the origin of RZn and that has (p0 (z)p̄m (z))z∈Zn1 as its normal and some ε > 0 as its constant. The function π̄n1 maps each point pn ∈ Pn \ Pn0 to ε(η(pn ))−1 ψZn1 (pn ) ∈ Hn . Lemma C.4. For a strategy sn of Sn1 , π̄n1 (sn ) is a vertex of Π1n iff there exists a unique information set hn ∈ Hn∗ with the property that sn enables hn and chooses a non-equilibrium action there. Proof. Let sn ∈ Sn1 be a pure strategy satisfying the condition of the lemma. We prove by contradiction that π̄n1 (sn ) is a vertex of Π1n . Therefore suppose to the contrary that π̄n1 (sn ), which equals ε(η(pn ))−1 ψZn1 (pn ), is not a vertex of Π1n . We can express π̄n1 (sn ) as a unique P convex combination Jj=1 λj πn1,j where J > 1 and for each 1 6 j 6 J, πn1,j is a vertex of −1 1,j 1 (s Π1n . For each j, since πn1,j is a vertex of Π1n , we can express πn1,j as ε(η(s1,j n ) for n )) ψZn ′ j 1,j 1 some s1,j n in Sn . Since λ > 0, sn cannot choose a non-equilibrium action at any hn 6= hn 32 SRIHARI GOVINDAN AND ROBERT WILSON in Hn∗ that it enables; since it belongs to Sn1,j it must therefore enable hn ; and it cannot choose a different non-equilibrium action from sn at hn . Observe now that the probability η(pn ) and η(s1,j n ) for all j are equal and exactly the probability that Nature and the strategy P 1,j 1,j p̄m do not exclude hn . Therefore, ψZn1 (pn ) = j λj ψZn1 (s1,j n ). Modify sn to a strategy tn so that at every information set other than the successors to hn , t1,j n agrees with sn , and at the successors to hn it agrees with s1,j n . It is now clear that when viewed as enabling P j 1,j strategies, sn = j λ tn and thus sn is a convex combination of the strategies t1,j n . But that is a contradiction since sn is a vertex of Pn and all the strategies t1,j n are different from one 1,j another and from sn as they induce the points πn that are different from one another and from π̄n1 (sn ) in Hn . To prove the other way around, suppose sn is a strategy that, at a collection hkn for k = 1, . . . , K of at least two information sets in Hn∗ , chooses a non-equilibrium action. For each k, k 1 choose a strategy s1,k n in Sn that enables hn , agrees with sn there and at all its successors, but P at other hn ∈ Hn∗ , chooses an equilibrium action. Then ψZn1 (sn ) = k ψZn1 (s1,k n ). Therefore, π̄n1 (sn ) cannot be a vertex of Π1n . ¤ For each Tn0 ⊆ Sn0 such that Pn (Tn0 ) is a face of Pn0 and contains an equilibrium strategy for n, let Sn1 (Tn0 ) be the subset of Sn1 consisting of strategies s1n such that for each first information set h0n in Hn0 that s1n does not exclude, s1n agrees with some t0n ∈ Tn0 at h0n and all its successors, where t0n has the property that it does not exclude h0n . For a face Ψ1n of Π1n , let Sn1 (Tn0 ; Ψ1n ) = Sn1 (Ψ1n ) ∩ Sn1 (Tn0 ) and Tn ≡ Tn0 ∪ Sn1 (Tn0 ; Ψ1n ). For notational simplicity, we refer to Sn1 (Tn0 ; Ψ1n ) as Tn1 . The following lemma provides an important feature of the set Tn . Lemma C.5. The strategies in Tn are the vertices of a face of Pn whose dimension is d(Tn ) ≡ d(Tn0 ) + d(Ψ1n ) + 1. Proof. Let T̂n1 be the set of strategies t1n in Tn1 such π̄n1 (t1n ) is a vertex of Ψ1n . We will first show that every tn ∈ Tn \ (Tn0 ∪ T̂n1 ) is affinely dependent on the strategies in Tn0 ∪ T̂n1 . Let ∗ t1n ∈ Tn \ (Tn0 ∪ T̂n1 ). By Lemma C.4, there exist information sets h1n , . . . hK n , K > 1, in Hn such that for each k, t1n chooses a non-equilibrium action ak at hkn , and at each other hn ∈ Hn∗ it chooses an equilibrium action. Fix t0n ∈ Tn0 that agrees with t1n everywhere except at the 1 information sets hkn for each k. For each k let t1,k n be the strategy that agrees with tn at 1 hkn and its successors, but everywhere else agrees with t0n . Each t1,k n belongs to T̂n and also, P 0 1 1 0 t1n = k t1,k n − (K − 1)tn . Thus, tn is an affine combination of the strategies in Tn ∪ T̂n . AXIOMATIC EQUILIBRIUM SELECTION 33 1 1,j 0 1 For each j = 0, . . . , d(Ψ1n ), pick a strategy t1,j n ∈ Sn (Tn ) such that π̄n (tn ) is a vertex of Ψ1n . Let T̃n1 be the collection of these strategies. We show that strategies in T̂n1 \ T̃n1 are now affinely dependent on the strategies in Tn0 ∪ T̃n1 . Fix t1n ∈ T̂n1 \ T̃n1 . By Lemma C.4 there exists a unique information set hn ∈ Hn∗ enabled by t1n and where it chooses a non-equilibrium action. J 1 1 1 By construction of T̃n1 , there exists a subset (t1,j n )j=1 of T̃n such that π̄n (tn ) is expressible P j 1,j as an affine combination j λj π̄(t1,j n ) with λ 6= 0 for all j. For each j, sn enables hn and chooses a at hn ; at all other information sets in Hn∗ it chooses an equilibrium action. Let t0n be a strategy that agrees with t1n everywhere except at hn and its successors. For each j, let 1,j t0,j n be a strategy that agrees with tn everywhere except at hn and its successors, where it P 0,j agrees with t0n . Then t1n = t0n + j λj (t1,j n − tn ) and is affinely dependent on the strategies in Tn0 ∪ T̃n1 . It follows now from the above arguments that the affine space A spanned by Tn0 ∪ T̃n1 contains Pn (Tn ) and that the dimension of Pn (Tn ) is as stated. To finish the proof of the lemma, we show that Pn (Tn ) is a face of Pn . Let Qn be the smallest face of Pn that contains Pn (Tn ). Suppose Qn 6= Pn (Tn ). Then there exists a point pn in the relative of interior of Pn (Tn ) and Qn . Therefore pn can be expressed as a convex combination of the vertices of P j 1,j P 0,i 0 1,j Qn in two different ways: (a) i λi t0,i n + j λ tn , where the tn ’s are in Tn and the tn ’s P j 1,j P k 2,k P 2,k are in Tn1 ; (b) i µi t0,i n + k µ tn , where now the tn ’s are the vertices of Qn j µ tn + 0 0 that are not in Tn . Consider one of the t2,k n ’s. If it belongs to Sn \ Tn then by Lemma C.3 there is an information set hn in Hn0 that is enabled by t2,k n where the action chosen by hn is avoided all strategies in Tn0 that enable it: that implies under the expression in (b) that the nodes following this action are assigned a positive probability, but not under (a), which 1 1 1 1 is impossible. If t2,k n belongs to Sn \ Tn then it must belong to Sn (Ψn ) since otherwise under / Tn1 there exists an information set hn ∈ Hn0 enabled by t2,k / Ψ1n . Since sn ∈ (b) π̄n1 (pn ) ∈ n 0 0 2,k where the continuation strategy of tn coincides with that of some tn ∈ Sn \ Tn but not for any sn ∈ Tn0 . As in the previous case, this too is impossible. ¤ One corollary of the above result obtains when we take Tn1 to be Sn1 and Ψ1n to be Π1n . The dimension of Pn is d(Pn0 ) + d(Π1n ) + 1. Observe that Pn1 is a face of Pn iff ψZn1 (Pn1 ) is a face of ψZn1 (Pn ), which is equivalent to saying that Π1n is homeomorphic to ψZn1 (Pn1 ). Thus, the dimension of Pn1 equals d(Π1n ) if Pn1 is a face of Pn and otherwise it equals the dimension of Pn . Fix T ≡ ((T10 , Ψ11 ), (T20 , Ψ12 )), where for each n, Pn (Tn )∩Q∗n nonempty and Ψ1n is a (possibly empty) face of Π1n . Let An (T ) be the set of points in Pn (Tn0 ) such that the strategies in Tm 34 SRIHARI GOVINDAN AND ROBERT WILSON are all best replies. Let T̃n0 be the unique subset of Tn0 such that the interior of An (T ) is contained in the interior of Pn (T̃ 0 ). Let d˜∗ (T ) be the dimension of An (T ). n n Lemma C.6. If Tm1 is nonempty then An (T ) is a proper face of Q̄∗n ∩ Pn (T̃n0 ). Proof. Since Q∗n ∩Pn (Tn ) is nonempty, the strategies in Tn0 are undominated. Hence, Pn (T̃n0 )∩ Q̄∗n = Pn (T̃n0 ) ∩ Q∗n ≡ A∗n (T ). An (T ) is thus a face of A∗n (T ). There remains to show that it is a proper face. This follows from the genericity of payoffs. Fix t1m ∈ Tm1 . There exists ∗ an information set hm ∈ Hm where it chooses a non-equilibrium action a. If the path from each (x, a), for x ∈ hm that is reached under the equilibrium outcome, does not pass through an information set hn ∈ Hn0 , then a would be suboptimal against every equilibrium Q̄∗n . ∗ Thus, there exists a first information set hn ∈ Hn0 and nodes x ∈ Hm and y ∈ hn such that (x, a) ≺ y and x is reached under the equilibrium outcome. Because A∗n (T ) is nonempty, there is a strategy t0n ∈ T̃n0 that enables hn . Clearly, there must be multiple such strategies, again by genericity. Perturbing the probabilities of the terminal nodes following y does not affect the payoffs to strategies in Tm0 but they affect the payoff to t1m . Hence An (T ) is a proper face of A∗n (T ). ¤ 1 (Tm0 ) \ Tm1 such that: (i) sm is an equally good Let T̂m1 be the set of strategies sm in Sm reply against every point in Pn0 to which the strategies in Tm are equally good replies. Let 1 T̄m1 be the set of strategies sm in Sm (Tm0 ) \ (T̂m1 ∪ Tm1 ) that are best replies against every point in An (T ). Let Bn∗ (T ) be the closure of the points in the interior of Pn0 against which the strategies in Tm are equally good replies and at least as good replies as strategies in T̄m1 . Let d∗n (T ) be the dimension of Bn∗ (T ). If Tm1 is empty, let Bn (T ) = Pn0 . Otherwise, let Bn (T ) be the set of points in Pn0 that are of the form λqn0 + (1 − λ)rn0 such that λ > 1, qn0 ∈ Bn∗ (T ), and rn0 ∈ Pn (Tn0 ). Lemma C.7. Suppose Tm1 is nonempty. Bn (T ) is a polyhedron of dimension d∗n (T )+dn (Tn0 )− d˜∗n (T ). Each maximal face Bn′ (T ) of Bn (T ) satisfies exactly one of the following: (1) The relative interior of Bn′ (T ) is contained in the relative interior of a maximal proper face of Pn0 . (2) There exists a strategy rm ∈ T̄m1 such that for each p0n ∈ Bn′ (T ), rm is an equally good ∗ (T ); moreover, in this reply against every point of the form λp0n + (1 − λ)rn0 in Bm 1 1 , if rm is a best reply against case, letting Řm be the set of such rm , for any rm ∈ R̄m 1 is also a best reply against this point. a point in Bn∗ (T ) then every point in Řm AXIOMATIC EQUILIBRIUM SELECTION 35 (3) There exists a maximal proper face of Pn (Tn0 ), say Pn (Rn0 ), such that for each qn ∈ Bn∗ (T ), rn ∈ Pn (Tn0 ) and λ > 1, if λqn + (1 − λ)rn belongs to Bn′ (T ), then rn belongs to Pn (Rn0 ); moreover, An (T ) is contained in Pn (Rn0 ). Proof. Let P̃n0 , P̃n (Tn0 ), and B̃n∗ (T ) be the convex cones spanned by Pn0 , Pn (Tn0 ), and Bn∗ (T ) respectively. Let ξ : Pn0 × P̃n (Tn0 ) → P̃n0 be the function ξ(p0n , r̃n0 ) = p0n + r̃n0 . Then for each p̃0n , ξ −1 (p̃0n ) is a set of dimension d(Tn0 ). Hence the dimension of B̂(T ) ≡ ξ −1 (B̃n∗ (T )) is d(Tn0 ) + d∗n (T ) + 1. Obviously B̂(T ) is a polyhedron. For each face B̂n′ (T ) of B̂n (T ) and for all points (p0n , r̃n0 ) ∈ B̂n′ (T ) at least one of the following holds: (i) p0n belongs to the boundary of Pn0 ; (ii) there exists a strategy tm ∈ T̄m1 such that ξ(p0n , r̃n0 ) belongs to the convex cone spanned by the face of Bn∗ (T ) where this strategy tm is an equally good reply; (iii) r̃n0 belongs to the boundary of P̃n (Tn0 ). Observe now that Bn (T ) is the projection of B̂n (T ) onto the first factor. Obviously it is a polyhedron. For each p0n ∈ Bn (T ) and each r̃n0 such that (p0n , r̃n0 ) ∈ B̂n (T ), (p0n , r̃n0 + λrn0 ) ∈ B̂n (T ) for all rn0 ∈ Bn∗ (T ) ∩ Pn (Tn0 ) and λ > 0. If the set Bn∗ (T ) is in generic position (i.e. if the payoffs are in generic position), then all points in B̂n (T ) that project to p0n are of this form. Since the set Bn∗ (T )∩Pn (Tn0 ) is the intersection of Pn (Tn0 ) with the affine space spanned by An (T ), the dimension of Bn (T ) is as asserted. The enumerated properties of Bn (T ) now follow directly from the corresponding points above; only the last part of property (iii) needs a proof. Suppose rn0 belongs to a proper face Pn (Rn0 ) of Pn (Tn0 ) and An (T ) is not contained in Pn (Rn0 ). If (p0n , λrn0 ) belongs to B̂n (T ), then so does (p0n , λrn0 + r̄n0 ) for r̄n0 ∈ An (T ) \ Pn (Rn0 ) and λrn0 + r̄n0 does not belong to the convex cone generated by Pn (Rn0 ). ¤ Let Cn∗ (T ) be the closure of the set of qn in the interior of Pn such that the strategies in Tm are all equally good replies and at least as good as strategies in T̂m1 and Sn0 \ Tn0 . By Lemma C.5, the dimension of the face spanned by Tm is d(Tm0 ) + d(Ψ1m ) + 1. By genericity of payoffs, the dimension of Cn∗ (T ) is therefore d(Pn ) − d(Tm0 ) − d(Ψ1m ) − 1. Let Cn (T ) be the set of (p1n , πn1 ) ∈ Pn1 × Π1n such that there exist p0n ∈ Pn0 , p2n ∈ Pn (Tn1 ), P P P qn ∈ Cn∗ (T ), and µ ∈ R3+ , such that i µi pin ∈ Cn∗ (T ), i µi = 1, µ1 > 0, and π̄n1 ( i µi pin ) = πn1 . Lemma C.8. The set Cn (T ) is a polyhedron of dimension d(Pn0 ) + d(Pn1 ) + dn (Ψ1n ) − d(Tm0 ) − d(Ψ1m ) − d∗n (T ). On each maximal proper face C ′ of Cn (T ), exactly one of the following holds P for all (p1n , πn1 ) in C ′ . If qn ∈ C ∗ (T ) is of the form i µi pin for some p0n ∈ Pn0 and p2n ∈ Pn (Tn1 ), P and π̄n1 ( i µi pin ) = πn1 , then: (1) p1n belongs to a maximal proper face of Pn1 ; 36 SRIHARI GOVINDAN AND ROBERT WILSON i \ Tmi , which actually belongs to (2) for i = 0 or i = 1, but not both, there exists sm ∈ Sm T̂mi if i = 1, that is as good a reply as points in Tm against qn ; moreover, in this case, if i = 0, Tm0 and sm span a face of Pm0 of which Pm (Tm0 ) is a maximal proper face; and if i = 1, Ψ1m and π̄n1 (sm ) span a face of Π1m of which Ψ1m is a maximal proper face. (3) there exists a maximal proper face Ψ′ of Ψn such that π̄n1 (p2n ) belongs to Ψ′ . Proof. We show that Cn (T ) is a polyhedron of the stated dimension. Since the construction is similar to that in the previous lemma, the enumerated properties can be proved just as before. Let Pn2 be the convex hull of Pn0 and Pn (Tn1 ). Using Lemma C.5, the dimension of Pn2 is d(Pn0 ) + d(Ψ1n ) + 1. Let P̃n2 and P̃n be the convex cones spanned by Pn2 and Pn , respectively. Define ξ : Pn1 × P̃n2 → P̃n by ξ(p1n , p̃2n ) = p1n + p̃2n . Then for each p̃n in the interior of P̃n , ξ −1 (p̃n ) is a set of dimension d(Pn1 )+d(Pn0 )+d(Ψ1n )+1. Letting C̃n∗ (T ) be the convex cone spanned by Cn∗ (T ), the dimension of Ĉn (T ) ≡ ξ −1 (C̃n∗ (T )) is d(Pn0 ) + d(Pn1 ) + d(Ψ1n ) + 1 − d(Tm0 ) − d(Ψ1m ). The function π̄n1 extends to P̃ \ { 0 }. Cn (T ) is the image of Ĉn (T ) under the function χ : Pn1 × P̃n2 given by χ(p1n , p̃2n ) = (p1n , π̄n1 (p1n + p̃2n )) and is thus a polyhedron. As will be shown in the course of the proof of the next lemma, Cn∗ (T ) ∩ Pn (Tn1 ) ⊂ Pn0 . Therefore, Cn∗ (T ) ∩ Pn2 is the intersection of Pn0 with the affine space spanned by Bn∗ (T ). For each (p1n , p̃2n ) ∈ Ĉn (T ), the point (p1n , p̃2n +µqn0 ) belongs to Ĉn (T ) for all µ > 0 and qn0 ∈ Cn∗ (T )∩Pn2 , and has the same image under χ as (p1n , p̃2n ). Moreover, if Cn∗ (T ) is in general position then for each (p1n , πn1 ) in Cn (T ) every point in its inverse image under χ is expressible in this form. Therefore, the dimension of Cn (T ) is as given. ¤ Let T be the collection of T ’s such that An (T ), Bn (T ) and Cn (T ) are nonempty for each n. For each T ∈ T , let Qn (T ) = An (T ) × Bn (T ) × Cn (T ) for each n and let Q(T ) = Q1 (T ) × Q2 (T ). Lemma C.9. (q ∗ , p0 , p1 , π 1 ) belongs to Q iff it belongs to Q(T ) for some T ∈ T . Proof. Suppose for each n that qn∗ ∈ An (T ), p0n ∈ Bn (T ), (p1n , πn1 ) ∈ Cn (T ) for some T . Choose rn0 ∈ Pn (Tn0 ) and λ0n such that qn0 ≡ (1 − λ0n )p0n + λ0n rn0 ∈ Bn∗ (T ). Also, fix p̃0n , rn2 ∈ Pn (Tn1 ), µ0n , µ1n , µ2n such that qn1 ≡ µ0n p̃0n + µ1n p1n + µ2n rn2 belongs to Cn∗ (T ). Fix points q̄n0 and q̄n1 in the interior of An (T ) and Bn (T ) for each n and consider for each 0 < ε < 1, the LPS (q ∗ , q̃ 0 (ε), q̃ 1 (ε)) where for each n, q̃n0 (ε) = (1−ε)q̄n0 +εqn0 ; and q̃n1 (ε) = (1−ε)q̄n0 +εq̄n1 (ε)+ε2 qn1 . ∗ 0 1 The strategies in Tn are equally good replies to qm , q̃m (ε), q̃m (ε) for all ε. We show that these strategies are lexicographic best replies to (q ∗ , q̃ 0 (ε), q̃ 1 (ε)) for all small ε, which proves that (q ∗ , p0 , p1 , π 1 ) belongs to Q. AXIOMATIC EQUILIBRIUM SELECTION 37 ∗ and Observe first that for all ε, a strategy in Sn0 \ Tn0 is an equally good reply against qm 1 0 (ε) by construction (ε) as strategies in Tn0 , by Lemma C.2, and no better a reply against q̃m q̃m 1 1 1 ∗ of Cn (T ). Now for a strategy sn in Sn , consider the strategy tn that agrees with s1n everywhere except that starting at each first information set hn ∈ Hn0 that s1n enables, t1n agrees with some t0n in Tn0 . Since strategies in Tn0 are at least as good as the other strategies in Sn0 , 0 ∗ (ε), q̃n1 (ε)). Observe now that t1n belongs to , q̃m clearly t1n is at least good a reply against (qm ∗ Sn1 (Tn0 ). If it belongs to T̂n1 , then it is an equally good reply as strategies in Tn against qm 1 0 (ε) by definition. If it belongs to T̄n1 then (ε) and no better reply a reply against q̃m and q̃m 0 ∗ (ε) for all ε, and a strictly worse , no better reply against q̃m it is an equally good reply to qm 1 reply against q̃m (ε) for all small ε, again by construction, since it is an inferior reply against q̄n1 which belongs to the interior of Bn∗ (T ). Finally, if it belongs to Sn1 (Tn1 ) \ (T̂n1 ∪ T̄n1 ) then 0 ∗ (ε) for all small ε, since it is and an inferior reply to q̃m it is no better a reply against qm 0 an inferior reply to q̄n by construction of An (T ). Thus the strategies in T are lexicographic best replies to (q ∗ , q̃ 0 (ε), q̃ 1 (ε)) for all small ε. Before proceeding to prove the converse, we use the above argument to show that the intersection of Cn∗ (T ) with the convex hull Pn2 of Pn0 is in fact the intersection F of Pn0 with the affine space spanned by Bn∗ (T ). Take a point qn1 in Cn∗ (T ) ∩ P 2 . If it belongs to Pn0 , then in fact it belongs to F by the definitions of Bn∗ (T ) and T̂m1 . If it does not belong to Pn0 , then it assigns a positive weight to some strategy s1n ∈ Tn1 . The above argument applied when using ∗ is a best reply to (qn∗ , q̃n0 (ε), q̃n1 (ε)) and the strategies in Tn and Sn0 are this qn1 shows that qm ∗ . Observe now that q̃n1 (ε) is a convex combination of strategies in Sn0 and best replies to qm ∗ ) is an equilibrium if ε is Tn1 . Therefore, for all small δ, ((1 − δ − δ 2 )qn∗ + δ q̃n0 (ε) + δ 2 q̃n1 (ε), qm 1 small as well. But these points induce different outcomes because q̃n (ε) has a non-equilibrium strategy, namely one in Tn1 , in its support, which is impossible. Thus, Cn∗ (T ) ∩ Pn2 = F as claimed. Returning to the proof of this lemma, suppose (q ∗ , (p0 , p1 ), π 1 ) belongs to Q. Let qn0 = (1 − λ0n )p0n + λ0n rn0 and let qn1 = µ0n p̃0n + µ1n p1n + µ2n rn2 where (1 − λ0n )qn∗ + λ0n rn0 is a best reply against (q ∗ , q 0 , q 1 ); and rn2 , if µ2n > 0, is a best reply against qn∗ and a weakly better reply against (q ∗ , q 0 , q 1 ) than all the strategies in Pn1 . Let Q0n be the face of Pn0 that contains (1 − λ0n )qn∗ + λ0n rn0 . Let Q1n be the face of Pn1 that contains rn2 in its interior if µ2n > 0. Let Tn0 be the set of strategies tn in Sn0 such that if tn enables a first information set hn ∈ Hn0 then the choices from there on prescribed by t0n coincide with the choices dictated by some vertex of Q0n or Q1n that enables hn . Observe that each t0n ∈ Tn0 is optimal against (q ∗ , q 0 , q 1 ). If µ2n > 0, let Ψ1n be the face of Π1n that contains π̄n1 (rn2 ) in its interior; otherwise let Ψ1n be the empty set. 38 SRIHARI GOVINDAN AND ROBERT WILSON We can now assume without loss of generality that the strategies in Q0n and Q1n are equally 1 0 ∗ ) and hence best replies. Indeed, if for i either 0 or 1, if the , qm , qm good replies against (qm 1 i i strategies in Qn do not yield the same payoff against qm as those in Q0n , modify qm as follows: 0 ∗ in the face Q̃0n of Q0n containing qm pick a point řm in its interior such that the strategies in 0 1 Qn are equally good replies, the strategies in Qn do strictly better than the strategies in Q0n i ∈ [0, 1] such that and at least as well as the other strategies in Sn1 . There exists a unique νm 0 1 i i i i the strategies in Qn and Qn are now equally good replies against (1 − νm )qm řm . Thus, + νm our assumption is without loss of generality. Since the strategies in Q0n and Q1n are best replies. There remains to show that every ∗ 0 1 strategy in Sn (Tn0 ; Ψ1n ) is a best reply against (qm , qm , qm ). Fix sn ∈ Sn (Tn0 ; Ψ1n ). To show that it is a best reply it is sufficient to show that an information set hn that is enabled by sn is enabled by some vertex of either Q0n or Q1n and that this vertex agrees with sn ’s choice an there. Suppose that this hn is in Hn∗ and an ∈ A∗n , or hn belongs to Hn0 ; then obviously some strategy in Q0n or Q1n enables hn and chooses an , by the definition of Tn0 . If hn ∈ Hn∗ and an ∈ / A∗n or hn follows some information set in Hn∗ by the choice of a non-equilibrium action, then some strategy in Q1n enables it and chooses this action, since otherwise sn enables a terminal node that is excluded by all strategies in Q1n , contradicting the assumption that π̄n1 (sn ) ∈ Ψ1n . Thus sn is a best reply and (q ∗ , q 0 , q 1 ) belongs to Q(T ). ¤ Lemma C.10. Q is a pseudomanifold of dimension dˆ ≡ d(P). ˆ By the previous lemma Q = ∪T ∈T Q(T ). Proof. Each Q(T ) is a polyhedron of dimension d. ˆ To show that Q is a pseudomanifold, we establish three Therefore, Q has dimension d. facts for each (q ∗ , (p0 , p1 ), π 1 ) that belongs to some Q(T ): (1) if (q ∗ , (p0 , p1 ), π 1 ) belongs to the interior of Q(T ), then it does not belong to the interior of Q(R) for R 6= T ; (2) if (q ∗ , (p0 , p1 ), π 1 ) is a generic point in a maximal proper face Q′ of Q(T ), then it does not belong to Q(R) for any R 6= T if (p0 , p1 ) ∈ ∂P, and it belongs to the boundary of Q(R) for exactly one other R 6= T if (p0 , p1 ) ∈ / ∂P; moreover in the latter case it belongs to the interior of a maximal proper face of this Q(R) as well; (3) given T, R ∈ T , there exists a finite chain T = T (0), . . . , T (k) = R such that for each 0 ≤ j ≤ k − 1, Q(T (j)) ∩ Q(T (j + 1)) is a subset of a maximal proper face of each and has a nonempty interior in this face. Fix T = (T 0 , Ψ1 ) and x = (q ∗ , (p0 , p1 ), π 1 ) ∈ Q(T ). For each n, choose qn0 ≡ (1 − λ0n )p0n + λ0n rn0 in Bn∗ (T ) and qn1 ≡ µ0n p̃0n + µ1n p1n + µ2n rn1 ∈ Cn∗ (T ). We start with (1). Suppose now that x belongs to the interior of Q(R) for some R. We show that R = T . Since x belongs to the interior of Q(T ), we can assume that every strategy 0 0 \ Tm0 . Since sm is an inferior reply \ Tm0 is inferior to qn1 . Let sm be a strategy in Sm in Sm AXIOMATIC EQUILIBRIUM SELECTION 39 1 compared to the strategies in Tn0 , by Lemma C.3 there exists an information set against qm hn ∈ Hn0 that is enabled by sn where the action chosen by sn is suboptimal and different from the action chosen by every tn ∈ Tn0 that enables hn . But the posterior belief over the 1 1 . This implies that can be computed from πm terminal nodes following hn computed from qm 1 1 ′ 1 ′ ) cannot for any qm such that π̄m (qm ) = πm , sn is an inferior strategy. Therefore, (p1m , πm 0 0 0 0 belong to Cm (R) unless Rm ⊆ Tm . Moreover, if Rm ( Tm , then it cannot belong to the 0 are also optimal. Thus, if x belongs to the interior of Cm (R), since strategies in Tm0 \ Rm 0 0 1 interior of Q(R), Rm = Tm for each m. If Ψm is empty, this implies that Rm = Tm . Suppose 1 1 now that Ψ1m is nonempty. Since x is in the interior we can assume that π̄m (p1m ) 6= πm and 1 1 ) can be computed uniquely from p1m that π̄n (rm ) is in the interior of Ψ1m . Observe that π̄n1 (rm 1 1 1 and computing the boundary and πm by taking the line segment from π̄m (p1m ) through πm point of this line. This implies that if x belongs to the interior of Q(R), then Rm = (Tm0 , Ψ1m ). Thus R = T . We turn to point (2). Suppose now x belongs to the relative interior of a maximal proper face of Q(T ) and that (p0 , p1 ) belongs to ∂P . Then if it belongs to another Q(R) it cannot be in the interior and must belong to the boundary. The arguments of the previous paragraph apply to show that x does not belong to the interior of a maximal face of Q(R). Indeed, 0 it relied on strategies in Sm \ Tm0 being inferior to qn1 , and qn0 (resp. qn1 ) not belonging to the boundary of Bn (T ) (resp. Cn (T )). Thus x must belong to a face of dimension at most dˆ − 2. The set of such points in this maximal proper face of Q(T ) then has dimension at most dˆ − 2, i.e. it is nongeneric. Suppose that x belongs to the relative interior of a maximal proper face of Q(T ) but that (p0 , p1 ) is in the interior in P. Then for exactly one n, just one of the following hold: (2a) qn∗ belongs to the boundary of A∗n (T ); (2b) p0n belongs to the boundary of Bn (T ); (2c) (p1n , πn1 ) belongs to the boundary of Cn (T ). We start with (2c). By the properties we proved for Cn (T ) in Lemma C.8, and since (p0 , p1 ) ∈ / ∂P, either property (ii) or property (iii) of that lemma holds. Under property (ii) x belongs to the boundary of Q(R) where 0 i R = ((Rm , Φ1m ), (Tn , Ψ1n )) is defined as follows. If the strategy rm identified there belongs 0 0 0 i to Sm , then Rm is the vertex set of the face spanned by rm and Tm , while Φ1m = Ψ1n ; if the i 0 strategy rm belongs to Sn1 (Tn0 ; Ψ1n ), then Rm = Tm0 and Φ1m is a face of Π1m that has Ψ1m as 1 1 a maximal face with πm (rm ) ∈ Φ1m \ Ψ1m . Under property (iii) x belongs to the boundary of Q(R) where R = ((Tn0 , Φ1n ), (Tm0 , Ψ1m )) where Φ1n is the maximal proper face of Ψ1n identified there. Suppose x satisfies (2b). Then by the properties we proved for Bn (T ) in Lemma C.7, 1 be the either property (ii) or property (iii) of that lemma holds. Under property (ii) let R̃m 40 SRIHARI GOVINDAN AND ROBERT WILSON set of strategies in T̃m1 that are now best replies against qn0 . Let Φ̃1m be the smallest face of 1 1 1 1 Π1m that contains Ψ1m and the vectors πm (r̃m ) for r̃m ∈ R̃m . Then the strategies in Tm0 and 1 Sm (Φ1m ) are equally good replies against qn0 . Moreover by the genericity of x, if one of these strategies is a best reply against a point in Bn∗ (T ) then all these points are best replies as well. For each face Φ1m of Φ̃1m that has Ψ1m a maximal proper face, choose a strategy rm (Φ1m ) that maps to a vertex of Φ1m that is not contained in Ψ1m . The set of points in Bn∗ (T ) against which the strategies in R̃m are as good replies as Tm has dimension d∗n (T ) − 1. However, the set of points in Cn∗ (T ) where two or more of these strategies rm (Φ1m ) are also best replies has dimension d(Pn ) − d(Tm0 ) − d(Ψ1m ) − 3 or less. Therefore, for R and R′ of the form ((Tn0 , Ψ1n ), (Tm0 , Φ1m )) the set of (p1n , πn1 ) that lies in the intersection Cn (T ) ∩ Cn (R) ∩ Cn (R′ ) is at most d0n + d1n + dn (Ψ1n ) − d∗n (T ) − 1 or less. This implies that generic (p1n , πn1 ) in Cn (T ) belongs to at most one of these sets. Moreover, if x belongs to Q(R), then it belongs to 1 is a convex combination of the boundary of Q(R): indeed the point φ1m in Φ1m such that πm 1 1 1 π̄m (pm ) and φm is uniquely determined, as we argued above; since this point belongs to Ψ1m , which is a face of Φ1m , x indeed belongs to the boundary of Q(R) if it belongs to Q(R). To 1 finish the proof of this case, we now show that x belongs to at least one Q(R). Take an r̃m 1 that yields the highest payoff against qn1 among the strategies in R̃m . If this payoff is higher than the payoff to the strategies in Tm0 , pick a point q̄n0 in the interior of Bn∗ (T ) and replace 1 qn1 with q̃n1 (ε) ≡ (1 − ε)qn1 + εq̄n0 where ε is the unique number where the strategies Tm0 and r̃m 1 1 are equally good replies; then x belongs to some Q(R) that has πm (r̃m ) as an extra vertex. 1 0 0 If the payoff to r̃m is lower, take a point q̄n in Pn against which the strategies in Tm0 are 1 equally good and worse than the strategies in R̃m and repeat the argument to show that x belongs to Q(R). Finally, suppose that x satisfies (2a). Let A′n be a maximal proper face of An (T ) that 1 1 contains qn∗ in its interior. Let Řm be the set of strategies řm in Sm (Tm0 ) \ (Tm1 ∪ T̂m1 ∪ T̄m1 )) 1 that are best replies against all the points in A′n . Observe that Řm is nonempty if qn∗ belongs to the interior of Pn (T̃n0 ). Indeed, in this case, the interior of A′n which is a face of An (T ) is 1 contained in the interior of Pn (T̃n0 ), which implies that some strategy in Sm is now optimal 0 0 0 against every point in this face. Let Rn be the set of subsets Rn of Tn such Pn (Rn0 ) is a maximal proper face of Pn (Tn0 ) and Pn (Rn0 ) ∩ Q∗n = A′n . Observe that R0n is nonempty if qn∗ belongs to the boundary of Pn (T̃n0 ). 1 1 1 (řm ) for řm ∈ Let Φ̌1m be the smallest face of Π1m that contains Ψ1m and the vectors πm 1 Řm . For each face Φ1m of Φ̌1m that has Ψ1m as a maximal proper face, choose a strategy 1 řm (Φ1m ) that maps to a vertex of Φ1m that is not contained in Ψ1m . Take R and R′ of AXIOMATIC EQUILIBRIUM SELECTION 41 the form ((Tn0 , Ψ1n ), (Tm0 , Φ1m )). Since A′n is a face of An (T ), d˜∗n (R) = d˜∗n (R′ ) = d˜∗n (T ) − 1. 1 are inferior replies to points in the interior of An (T ), Bn∗ (R) Since the strategies in Řm if nonempty has dimension d∗n (T ) − 1. Therefore, if Bn (R) 6= Bn (R′ ), their intersection with Bn (T ) has codimension 1 in Bn (T ) and x cannot belong to two of these sets at once. On the other hand, if Bn (R) = Bn (R′ ) then an argument similar to that in the previous paragraph shows that generic (p1n , πn1 ) ∈ Cn (T ) can belong to at most one of these sets, Cn (R) and Cn (R′ ). Hence a generic x belongs to at most one of these sets. Likewise, for R of the form ((Rn0 , Ψ1n ), (Tm0 , Ψ1m )) with Rn0 ∈ R0n and R′ of the form ((Rn′ 0 , Ψ1n ), (Tm0 , Ψ1m )) or ((Tn0 , Ψ1n ), (Tm0 , Φ1m )), the intersection of Bn (T ) with Bn (R) and Bn (R′ ) has codimension at least one. Thus x belongs to at most one set Q(R). To finish the proof of this part, we show that it belongs to at least one such set. 1 Let řm (Φ1m ) be a strategy that is a lexicographic best reply to (qn0 , qn1 ) among the strategies 1 in this class. If řm (Φ1m ) is a lexicographic (weakly) better reply against (qn0 , qn1 ) (or it is a worse reply and qn∗ belongs to the interior of Pn (T̃n0 )) we choose a point q̄n0 in the interior of 1 Pn (T̃n0 ) against which the strategy řm (Φ1m ) is at least as good as the other points in this class and inferior (resp. superior) to strategies in Tm0 . The strategies in Tm0 and Sm (Tm0 ; Φ1m ) are now equally good replies against some average of qn0 and q̄n0 , as well as some average of qn1 and q̄n0 . The point x then belongs to ((Tm0 , Φ1m ), (Tn0 , Ψ1n )). If qn∗ belongs to the boundary of 1 Pn (T̃n0 ), and řm (Φ1m ) is an inferior reply, then Řn0 is nonempty. There exists Rn0 such that qn0 is a convex combination of a point in Pn (Ťn0 ) and p0n . Then x belongs to ((Ťn0 , Ψ1n ), (Tm0 , Ψ1m )). This concludes the proof of (2). We turn now to (3). Given Q(T ) and Q(R) for T = (T 0 , Ψ0 ) 6= R = (R0 , Φ1 ), we will first construct a sequence T = T (1), . . . , T (k) = T̃ where T̃ = ((T̃n0 , ∅), (T̃m0 , ∅)). And, likewise one from R to R̃. Then we will show how to construct a sequence from T̃ to R̃. In case T̃n0 6= Tn0 for some n, let T = T (0), . . . , T (k) be a sequence where for each j > 0, T (j) = ((Tn0 (j), Ψ1n ), (Tm0 (j), Ψ1n )) with Pn (Tn0 (j))×Pm (Tm0 (j)) being a maximal proper face of Pn (Tn0 (j−1))×Pm (Tm0 (j−1)), and Tn (k) = T̃n0 for each n. This sequence generates a sequence of polyhedra Q(T ) = Q(T (0)), . . . , Q(T (k)) where for each j > 0, the intersection of Q(T (j)) with Q(T (j − 1)) is contained in a maximal proper face of each and has a nonempty interior. After this operation we have Q(T̃ (k)), where T̃ (k) = ((T̃n0 , Ψ1n ), (T̃m0 , Ψ1m )). In case Ψ1n is nonempty for some n, let Ψ1 = (Ψ1n (0), Ψ1m (0)), . . . , (Ψ1n (l), Ψ1m (l)) = (∅, ∅) be a sequence such that for each 1 6 j ≤ l, Ψ1m (j)×Ψ1m (j) is a maximal proper face of Ψ1n (j −1)×Ψ1m (j −1), and Ψ1 (l) = ∅. This way we can connect Q(T̃ (k)) with Q(T̃ ) where T̃ = ((T̃n0 , ∅), (T̃m0 , ∅)). Now we show how to connect T̃ with R̃ for two sets T, R in T . Because Q∗n is connected for 42 SRIHARI GOVINDAN AND ROBERT WILSON 0 0 each n, there exists a sequence T̃ 0 = (Sn0 (0), Sm (0)), . . . , (Sn0 (l), Sm (l)) = R̃n0 where for each 0 1 6 j 6 l, either Pn (Sn0 (j)) × Pn (Sm (j)) is either a maximal proper face of Pn (Sn0 (j + 1)) or vice versa and for each n, Q∗n intersects the interior of the set Pn (Sn0 (j)). This generates 0 a sequence Q0 , . . . Qj , where Qj = ((Sn0 (j), ∅), (Sm (j), ∅)). Thus we have constructed a sequence of sets in T that connect T and R. ¤ This concludes the proof of the first statement in the theorem. Now we prove the second part. Lemma C.11. ψ : (Q, ∂Q) → (P, ∂P) is essential iff Q∗ is stable. Proof. Let Y = [0, 1] × P . For each 0 < ε 6 1, let Yε = [0, ε] × P and let ∂Yε be the boundary of Yε . Each (ε, p) ∈ Y defines a strategic game G(ε, p) where the strategy set is P but where the payoff from an enabling strategy profile q is the payoff in G from the profile (1 − ε)q + εp. If q is an equilibrium of G(ε, p), we say that (1 − ε)q + εp is a perturbed equilibrium of G(ε, p). Let E be the closure of the set of (ε, p, q) such that (ε, p) ∈ Y1 \ ∂Y1 and q is a perturbed equilibrium of G(ε, p). Let θ be the projection map from E. For each subset E of E and each 0 < ε, let (Eε , ∂Eε ) be E ∩ θ−1 (Pε , ∂Pε ). In [9] we show that there exists 0 < ε̄ 6 1 and a finite number of subsets E 1 , . . . , E K of E such that for each 0 < ε 6 ε̄: (i) (Eεk , ∂Eεk ) is a pseudomanifold (in fact a homology manifold) of dimension d(P ) + 1 for each k; (ii) Eεk ∩ Eεj ⊂ ∂φ−1 (∂P1 ) for k 6= j; (iii) ∪k Eεk = Eε . In [6] we show that there exists a neighborhood of Σ∗ and an ε > 0 such that the set of ε-perfect equilibria in this neighborhood is connected. The image of this set of ε-perfect equilibria under ρ is therefore connected. Thus there exists some k such that E0k = { 0 }×Q∗ . Q∗ is stable iff the projection θ from E k to Yε̄ is essential (Mertens [19]). For simplicity in notation we refer to this E k as simply E. It is now sufficient to prove that ψ is essential iff the projection θ from E is essential. For each n, P̃n ≡ [0, 1] × Pn and define χn : P̃n → Pn by χn (λn , p0n , p1n ) = (1 − λn )p0n + λn p1n . Then we have that ((P̃n , ∂ P̃n ), (Pn , ∂Pn ), χn ) is a ball-bundle. Let χ be the product map χ1 × χ2 ; then ((P̃ , ∂ P̃ ), (P, ∂P ), χ) is a ball-bundle too. Let Ŷε = [0, ε] × P̃ . Then χ induces a map hY : Ŷ1 → Y1 by h(ε, λ, p0 , p1 ) = (ε, χ(λ, p0 , p1 )). Now ((Ŷε , ∂ Ŷε ), (Yε , ∂Yε ), hY ) is a ball-bundle. Let Ẽ be the set of all ((ε, λ, p0 , p1 ), q) ∈ Ŷ1 × P such that hE (ε, λ, p0 , p1 , q) ≡ (ε, f (λ, p0 , p1 ), q) ∈ E. Then also ((Ẽε , ∂ Ẽε ), (Eε , ∂Eε ), hE ) is a ball-bundle. Moreover, letting θ̃ be the projection from Ẽ to P, we have that hY ◦ θ̃ = θ ◦ hE . Therefore, by the Thom Isomorphism Theorem, θ is essential iff θ̃ is; cf. [20, Appendix IV.3]. AXIOMATIC EQUILIBRIUM SELECTION 43 Let Ê be the closure of the set of (ε, λ, p0 , p1 , q, π 1 (q)) such that (ε, λ, p0 , p1 , q) ∈ Ẽ and λ 6= 0. By the strong excision property, the natural projection φ̂ from Ê to Ẽ induces an isomorphism of their cohomology groups. Let θ̂ be the projection from Ê to Ŷε . Then θ̂ = θ̃ ◦ φ̂. Therefore, θ̃ is essential iff θ̂ is. Let η : Ê → R3+ be the projection map η(ε, λ, (p0 , p1 ), q, π 1 ) = (ε, λ). Let D = η(Ê). By the generic local triviality theorem, there exists a partition of D into a finite number of connected subsets D10 , . . . , Dl0 , and for each Di a semi-algebraic fibre pair (Fi , ∂Fi ), a homeomorphism hi : Di0 × (Fi , ∂Fi ) → (η −1 (Di0 ), η −1 (Di0 ) ∩ ∂ Ê) such that η ◦ hi is the projection from Di0 × Fi to Di0 . There now exists an i, say 1, such that the closure D1 of a subset of Di0 is homeomorphic to a 3-simplex and contains [0, ε̃] × { 0 } for some ε̃ < ε̄. Let (Ŷ (D1 ), ∂ Ŷ (D1 )) ≡ (D1 , ∂D1 ) × (P, ∂P) and let Ê(D1 ) be the closure of the inverse image of D1 × F1 under h1 . Let θ̂(D1 ) : (Ê(D1 ), ∂ Ê(D1 )) → (Ŷ (D1 ), ∂ Ŷ (D1 )) be the projection. We claim that (Ê(D1 ), ∂ Ê(D1 )) is a pseudomanifold. Since Ê is a pseudomanifold, this claim is proved if we show that Ê(D1 ) \ ∂ Ê(D1 ) is path-connected. There exist 0 < ε̂ < ε̃ and integers rn > 1 for each n such that (ε, λ) ∈ D1 if 0 < ε < ε̂ and 0 ≤ λn ≤ εrn −1 . Now given two points x(0) and x(1) in Êε̂ (D1 ) \ ∂ Êε̂ (D1 ), connect them by a curve x(t) = (ε(t), λ(t), p0 (t), p1 (t), q(t), π 1 (q(t))) in Êε̄ \ ∂ Êε̄ as t goes from 0 to 1. For each t, express q(t) as ε(t))(λ(t)p0 (t) + (1 − λ(t))p1 (t)) + (1 − ε(t))r(t) where r(t) is a best reply to q(t). For each t we can now find a point q ∗ (t) ∈ Q∗ such that r(t) is a best reply against q ∗ (t). Choose a positive ε such that ε + εrn < ε̂. For each n, modify xn (t) to the vector x̃n (t) = (ε + εrn ε(t), λ̃(t), p̃0 (t), p1 (t), q̃(t), π 1 (t)) where λ̃n (t) = p̃0n (t) = εrn ε(t) λn (t), ε + ε rn εq ∗ (t) + εrn ε(t)(1 − λ(t))p0 (t) , ε + εrn ε(t)(1 − λ(t)) q̃n (t) = (1 − εrn )q ∗ (t) + εrn (1 − ε(t))r(t) + εrn ε(t)((1 − λ(t))p0 (t) + λ(t)p1 (t)). Then x̃(t) belongs to Ê(D1 ) \ ∂ Ê(D1 ) for all t. Moreover, for t = 0, 1, x(t) and x̃(t) can now be connected by a path x̂(t; s) defined as follows. For s ∈ [0, 1], let kn (s) = min(1, 2s)rn and then: x̂n (t; s) = ((2s − 1)+ ε + εkn (s) ε(t), λ̂(t; s), p0 (t; s), p1 (t), q̂(t; s), π 1 (t)) 44 SRIHARI GOVINDAN AND ROBERT WILSON where λ̂n (t) = p̃0n (t; s) = εkn (s) ε(t)λn (t) , (2s − 1)+ ε + εkn (s) ε(t) (2s − 1)+ εqn∗ (t) + εrn ε(t)(1 − λn (t))p0n (t) , (2s − 1)+ ε + εkn (s) ε(t)(1 − λn (t)) q̃n (t; s) = (1 − εkn (s) )qn∗ (t) + εkn (s) (1 − ε(t))r(t) + εkn (s) ε(t)((1 − λn (t))p0n (t) + λn (t)p1 (t)). Thus Ê(D1 )\∂ Ê(D1 ) is connected and hence a pseudomanifold of dimension d(P)+3. Ŷ (D1 ) is a full-dimensional subset of Yε̄ . Therefore θ̂ is essential iff θ̂(D1 ) is essential. Since (Ê(D1 ), ∂ Ê(D1 )) is a pseudomanifold, (F1 , ∂F1 ) is a pseudomanifold of dimension d(P). Moreover, for each (ε, λ) ∈ D10 , letting (Êε,λ , ∂ Êε,λ ) ≡ h−1 1 ({ ε, λ } × (F1 , ∂F1 )) we have that θ̂ is essential iff θ̂ε,λ , the projection map (Êε,λ , ∂ Êε,λ ) → (P, ∂P ), is essential. Let L be the set of (ε, λ) ∈ D1 such that λn = εr for some r > rn for each n. Let Ê(L) be the closure of the inverse image of L \ { (0, 0) } under h1 . Let ∂ Ê(L) be the inverse image of ∂P under the projection map θ(L) from Ê(L). For each (ε, λ) ∈ L, (Eε,λ , ∂Eε,λ ) is (θ(L))−1 ({ (ε, λ) } × (P, ∂P)). Our next objective is to show that (Q, ∂Q) equals the set of (q ∗ , (p0 , p1 ), π 1 ) such that (0, 0, (p0 , p1 ), q ∗ , π 1 ) belongs to E0,0 (L). Given (0, 0, (p0 , p1 ), q ∗ , π 1 ) in Ê(L) there exists a sequence (ε(k), λ(k), (p0 , p1 )(k), q(k), π 1 (k)) in Ê(L) \ E0,0 converging to it. For each n and k, we can express qn (k) as (1 − ε(k))((1 − µ1n )qn0 (k) + µ1n rn1 (k)) + ε(k)(λ(k)p0n (k) + (1 − λ(k))p1n (k)), where µ1n and λ(k) converge to zero, qn0 (k) belongs to Pn0 and converges to qn∗ , and rn1 (k) belongs to Pn1 . Also qn0 (k) and rn1 (k) if µ1n > 0 are best replies to q(k) for all k. By going to a subsequence, the faces to which qn0 (k) and rn0 (k) belong are constant for all k. The sequence generates for each n an LPS Λn = (q̄n0 , . . . , q̄nl ) where q̄n0 = qn∗ . As in the proof of Theorem 5.1, there exists a level lni for each i = 0, 1 that is expressible as a convex combination of p0n and another strategy. Since λ(k) converges to zero, ln0 < ln1 . As in the proof of Theorem 5.1, we can prove that q̄nl belongs to Q̄∗n for each l < ln0 l0 and if we express q̄nn = νn0 p0n + (1 − νn0 )rn0 , then the strategies q̄nl for l < ln0 and rn0 if ν 0 < 1 l1 are lexicographic best replies against Λm . Likewise, if we express q̄nn as ν̄n0 p̃0n + ν̄n1 p1n + ν̄n2 rn2 , then the strategy rn2 if ν̄n2 > 0 is a lexicographic best reply against Λm . As in the proof of Theorem 5.1, Section 5.6, we can now write down an LPS (q ∗ , q̄ 0 (ε), q̄ 1 (ε)) to show that (q ∗ , (p0 , p1 ), π 1 ) belongs to Q. Given (q ∗ , (p0 , p1 ), π 1 ) in Q, it belongs to Q(T ) for some T . There exist qn0 = (1 − λ0n )p0n + λ0n rn0 and qn1 = µ0n p̃0n + µ1n p1n + µ2n rn2 such that the strategies in Tm are best replies against the r LPS (q ∗ , qn0 , qn1 ). For all small ε, choose α(ε) such that εr µ1 = (εr µ1 + εr µ0 + α(ε)(1 − λ0n )p0n ) , AXIOMATIC EQUILIBRIUM SELECTION 45 where r satisfies the property in the first line of the previous paragraph. Then for all small ε, (ε, λ(ε), (p0 , p1 )(ε), q(ε)) belongs to Ê(L), where: λ(ε) = εr µ1 , εr µ1 + εr µ0 + α(ε)(1 − λ0n ) and (p0 , p1 )(ε) = (p0 (ε), p1 ) with for each n, p0n (ε) = εr µ0 p̃0n + α(ε)(1 − λ0n )p0n εr µ0 + α(ε) and q̂(ε) = (1 − α(ε) − εr )q ∗ + α(ε)q 0 + εr q 1 . For each (ε, λ), (Eε,λ , ∂Eε,λ ) is a pseudomanifold. Indeed for (ε, λ) 6= (0, 0) this follows from the fact that this pair is homeomorphic to (F1 , ∂F1 ), which is a pseudomanifold; for (0, 0), this follows from the fact that (E0,0 , ∂E0,0 ) is homeomorphic to (Q, ∂Q). The inclusion map (Eε,λ , ∂Eε,λ ) induces an isomorphism of the d(P)-th cohomology groups. Thus θ̂0,0 is essential iff θ̂ε,λ is essential for some (and then all) (ε, λ) ∈ L. The projection map θ̂0,0 is just the map ψ. Hence Q∗ is stable iff ψ is essential. ¤ In case Sn1 is empty, the construction is modified as follows. We can omit the sets Cn (T ) and Bm (T ) from the description of Q(T ). In the last lemma above, the vector λ is now just a number, one for player m. The simplex D constructed there is 2-dimensional and contains a curve L of the form λ = εr . The rest of the proof is essentially the same. This concludes the proof of the Theorem. ¤ Appendix D. The Genericity Assumption The conclusions of this paper necessarily hold only when, fixing the game tree, the payoffs lie in a generic set. Here we outline the nature of the genericity that is invoked. First, we require that the game have finitely equilibrium outcomes: in [4] we show that outside a lower-dimensional set every game has finitely many outcomes. Second, the constructions in Appendix C rely on certain polyhedra being in general position. Each of these polyhedra— there are finitely many of them—is a set of enabling strategies for a player n against which, in a certain class of strategies for player m, a subclass is optimal. Since these are defined by linear equations and inequalities in the payoffs of player m, the set of games where the arguments fail is a lower-dimensional set. Third, Lemma C.11 requires a characterization of stable sets that in [9] we show holds for all games outside a lower-dimensional set. 46 SRIHARI GOVINDAN AND ROBERT WILSON References [1] Blume, L., A. Brandenburger, and E. Dekel (1991): Lexicographic Probabilities and Equilibrium Refinements, Econometrica, 59, 81-98. [2] Govindan, S., and T. Klumpp (2002): Perfect Equilibrium and Lexicographic Beliefs, International Journal of Game Theory, 31, 229-243. [3] Govindan, S., and J.-F. Mertens (2003): An Equivalent Definition of Stable Equilibria, International Journal of Game Theory, 3, 339-357. [4] Govindan, S., and R. Wilson (2001): Direct Proofs of Generic Finiteness of Nash Equilibrium Outcomes, Econometrica, 69, 765-769. [5] Govindan, S., and R. Wilson (2002): Structure Theorems for Game Trees, Proceedings of the National Academy of Sciences USA, 99, 9077-9080. URL: www.pnas.org/cgi/reprint/99/13/9077.pdf [6] Govindan, S., and R. Wilson (2002): Maximal Stable Sets of Two-Player Games, International Journal of Game Theory, 30, 557-566. [7] Govindan, S., and R. Wilson (2006): Sufficient Conditions for Stable Equilibria, Theoretical Economics, 1, 167-206. [8] Govindan, S., and R. Wilson (2008): “Nash Equilibrium, refinements of,” in S. Durlauf and L. Blume (eds.), The New Palgrave Dictionary of Economics, 2nd Edition. New York: Palgrave Macmillan. [9] Govindan, S., and R. Wilson (2008): Metastable Equilibria, Mathematics of Operations Research, 33, 787-820. [10] Govindan, S., and R. Wilson (2009): On Forward Induction, Econometrica, 77, 1-28. [11] Govindan, S., and R. Wilson (2009): Axiomatic Theory of Equilibrium Selection in Games with Games with Two Players, Perfect Information, and Generic Payoffs, Research Paper 2008, Stanford Business School, Stanford CA. [12] Hillas, J., and E. Kohlberg (2002): Conceptual Foundations of Strategic Equilibrium, in R. Aumann and S. Hart (eds.), Handbook of Game Theory, III, 1597-1663. New York: Elsevier. [13] Kahneman, D., and A. Tversky, editors (2000): Choices, Values, and Frames. Cambridge, UK: Cambridge University Press. [14] Kohlberg, E., and J.-F. Mertens (1986): On the Strategic Stability of Equilibria, Econometrica, 54, 1003-1037. [15] Koller, D., and N. Megiddo (1992): The Complexity of Two-Person Zero-Sum Games in Extensive Form, Games and Economic Behavior, 4, 528-552. [16] Kreps, D., and R. Wilson (1982): Sequential Equilibria, Econometrica, 50, 863-894. [17] Kuhn, H. (1953): Extensive Games and the Problem of Information, in H. Kuhn and A. Tucker (eds.), Contributions to the Theory of Games, II, 193-216. Princeton: Princeton University Press. Reprinted in H. Kuhn (ed.), Classics in Game Theory, Princeton University Press, Princeton, New Jersey, 1997. [18] Mailath, G., L. Samuelson, and J. Swinkels (1993): Extensive Form Reasoning in Normal Form Games, Econometrica, 61, 273-302. [19] Mertens, J.-F. (1989): Stable Equilibria–A Reformulation, Part I: Definition and Basic Properties, Mathematics of Operations Research, 14, 575-625. [20] Mertens, J.-F. (1991): Stable Equilibria–A Reformulation, Part II: Discussion of the Definition and Further Results, Mathematics of Operations Research, 16, 694-753. [21] Mertens, J.-F. (1992): The Small Worlds Axiom for Stable Equilibria, Games and Economic Behavior, 4, 553-564. AXIOMATIC EQUILIBRIUM SELECTION 47 [22] Nash, J. (1950): Equilibrium Points in N-Person Games, Proceedings of the National Academy of Sciences USA, 36, 48-49. [23] Nash, J. (1951): Non-Cooperative Games, Annals of Mathematics, 54, 286-295. [24] Norde, H., J. Potters, H. Reijnierse, D. Vermeulen (1996): Equilibrium Selection and Consistency, Games and Economic Behavior, 12, 219-225. [25] Reny, P. (1992): Backward Induction, Normal Form Perfection and Explicable Equilibria, Econometrica, 60, 627-649. [26] Reny, P. (1992): Rationality in Extensive-Form Games, Journal of Economic Perspectives, 6, 103-118. [27] Reny, P. (1993): Common Belief and the Theory of Games with Perfect Information, Journal of Economic Theory, 59, 257-274. [28] Savage, L.J. (1954): Foundations of Statistics, New York: John Wiley & Sons. [29] Spanier, E.H. (1966): Algebraic Topology, New York: McGraw-Hill. [30] van Damme, E. (1984): A Relation between Perfect Equilibria in Extensive Form Games and Proper Equilibria in Normal Form Games, International Journal of Game Theory, 13, 1-13. [31] van Damme, E. (2002): Strategic Equilibrium, in R. Aumann and S. Hart (eds.), Handbook of Game Theory, III, 1523-1596. New York: Elsevier. Department of Economics, University of Iowa, Iowa City IA 52242, USA. E-mail address: srihari-govindan@uiowa.edu Stanford Business School, Stanford, CA 94305-5015, USA. E-mail address: rwilson@stanford.edu

Log In

AXIOMATIC EQUILIBRIUM SELECTION FOR GENERIC TWO-PLAYER GAMES

Related papers

Related papers

Related topics