Nothing Special   »   [go: up one dir, main page]

skip to main content
10.5555/2343776.2343821acmotherconferencesArticle/Chapter ViewAbstractPublication PagesaamasConference Proceedingsconference-collections
research-article

Strategy purification and thresholding: effective non-equilibrium approaches for playing large games

Published: 04 June 2012 Publication History

Abstract

There has been significant recent interest in computing effective strategies for playing large imperfect-information games. Much prior work involves computing an approximate equilibrium strategy in a smaller abstract game, then playing this strategy in the full game (with the hope that it also well approximates an equilibrium in the full game). In this paper, we present a family of modifications to this approach that work by constructing non-equilibrium strategies in the abstract game, which are then played in the full game. Our new procedures, called purification and thresholding, modify the action probabilities of an abstract equilibrium by preferring the higher-probability actions. Using a variety of domains, we show that these approaches lead to significantly stronger play than the standard equilibrium approach. As one example, our program that uses purification came in first place in the two-player no-limit Texas Hold'em total bankroll division of the 2010 Annual Computer Poker Competition. Surprisingly, we also show that purification significantly improves performance (against the full equilibrium strategy) in random 4 x 4 matrix games using random 3 x 3 abstractions. We present several additional results (both theoretical and empirical). Overall, one can view these approaches as ways of achieving robustness against overfitting one's strategy to one's lossy abstraction. Perhaps surprisingly, the performance gains do not necessarily come at the expense of worst-case exploitability.

References

[1]
D. Billings, N. Burch, A. Davidson, R. Holte, J. Schaeffer, T. Schauenberg, and D. Szafron. Approximating game-theoretic optimal strategies for full-scale poker. In IJCAI, 2003.
[2]
A. Gilpin and T. Sandholm. A competitive Texas Hold'em poker player via automated abstraction and real-time equilibrium computation. In AAAI, 2006.
[3]
A. Gilpin and T. Sandholm. Lossless abstraction of imperfect information games. Journal of the ACM, 54(5), 2007.
[4]
A. Gilpin and T. Sandholm. Expectation-based versus potential-aware automated abstraction in imperfect information games: An experimental comparison using poker. In AAAI, 2008. Short paper.
[5]
A. Gilpin, T. Sandholm, and T. B. Sørensen. A heads-up no-limit Texas Hold'em poker player: Discretized betting models and automatically generated equilibrium-finding programs. In AAMAS, 2008.
[6]
S. Hoda, A. Gilpin, J. Peña, and T. Sandholm. Smoothing techniques for computing Nash equilibria of sequential games. Mathematics of Operations Research, 35(2), 2010.
[7]
M. Johanson, K. Waugh, M. Bowling, and M. Zinkevich. Accelerating best response calculation in large extensive games. In IJCAI, 2011.
[8]
D. Koller, N. Megiddo, and B. von Stengel. Fast algorithms for finding randomized strategies in game trees. In STOC, 1994.
[9]
R. Porter, E. Nudelman, and Y. Shoham. Simple search methods for finding a Nash equilibrium. Games and Economic Behavior, 63(2), 2008.
[10]
D. Schnizlein, M. Bowling, and D. Szafron. Probabilistic state translation in extensive games with large action sets. In IJCAI, 2009.
[11]
J. Shi and M. Littman. Abstraction methods for game theoretic poker. In CG '00: Revised Papers from the International Conference on Computers and Games. Springer-Verlag, 2002.
[12]
K. Waugh, D. Schnizlein, M. Bowling, and D. Szafron. Abstraction pathologies in extensive games. In AAMAS, 2009.
[13]
M. Zinkevich, M. Bowling, M. Johanson, and C. Piccione. Regret minimization in games with incomplete information. In NIPS, 2007.

Cited By

View all
  • (2016)Reflections on the first man vs. machine no-limit Texas hold 'em competitionACM SIGecom Exchanges10.1145/2904104.290410514:2(2-15)Online publication date: 16-Mar-2016
  • (2015)Robust learning for repeated stochastic games via meta-gamingProceedings of the 24th International Conference on Artificial Intelligence10.5555/2832581.2832725(3416-3422)Online publication date: 25-Jul-2015
  • (2015)Endgame Solving in Large Imperfect-Information GamesProceedings of the 2015 International Conference on Autonomous Agents and Multiagent Systems10.5555/2772879.2772888(37-45)Online publication date: 4-May-2015
  • Show More Cited By

Index Terms

  1. Strategy purification and thresholding: effective non-equilibrium approaches for playing large games

      Recommendations

      Comments

      Please enable JavaScript to view thecomments powered by Disqus.

      Information & Contributors

      Information

      Published In

      cover image ACM Other conferences
      AAMAS '12: Proceedings of the 11th International Conference on Autonomous Agents and Multiagent Systems - Volume 2
      June 2012
      601 pages
      ISBN:0981738125

      Sponsors

      • The International Foundation for Autonomous Agents and Multiagent Systems: The International Foundation for Autonomous Agents and Multiagent Systems

      In-Cooperation

      Publisher

      International Foundation for Autonomous Agents and Multiagent Systems

      Richland, SC

      Publication History

      Published: 04 June 2012

      Check for updates

      Author Tags

      1. abstraction
      2. game playing
      3. game theory
      4. large games
      5. non-equilibrium approaches
      6. purification
      7. reverse mapping

      Qualifiers

      • Research-article

      Conference

      AAMAS 12
      Sponsor:
      • The International Foundation for Autonomous Agents and Multiagent Systems

      Acceptance Rates

      Overall Acceptance Rate 1,155 of 5,036 submissions, 23%

      Contributors

      Other Metrics

      Bibliometrics & Citations

      Bibliometrics

      Article Metrics

      • Downloads (Last 12 months)0
      • Downloads (Last 6 weeks)0
      Reflects downloads up to 01 Nov 2024

      Other Metrics

      Citations

      Cited By

      View all
      • (2016)Reflections on the first man vs. machine no-limit Texas hold 'em competitionACM SIGecom Exchanges10.1145/2904104.290410514:2(2-15)Online publication date: 16-Mar-2016
      • (2015)Robust learning for repeated stochastic games via meta-gamingProceedings of the 24th International Conference on Artificial Intelligence10.5555/2832581.2832725(3416-3422)Online publication date: 25-Jul-2015
      • (2015)Endgame Solving in Large Imperfect-Information GamesProceedings of the 2015 International Conference on Autonomous Agents and Multiagent Systems10.5555/2772879.2772888(37-45)Online publication date: 4-May-2015
      • (2015)Hierarchical Abstraction, Distributed Equilibrium Computation, and Post-Processing, with Application to a Champion No-Limit Texas Hold'em AgentProceedings of the 2015 International Conference on Autonomous Agents and Multiagent Systems10.5555/2772879.2772885(7-15)Online publication date: 4-May-2015
      • (2013)Computing pure Bayesian-Nash equilibria in games with finite actions and continuous typesArtificial Intelligence10.1016/j.artint.2012.09.007195(106-139)Online publication date: 1-Feb-2013
      • (2012)Lossy stochastic game abstraction with boundsProceedings of the 13th ACM Conference on Electronic Commerce10.1145/2229012.2229079(880-897)Online publication date: 4-Jun-2012

      View Options

      Get Access

      Login options

      View options

      PDF

      View or Download as a PDF file.

      PDF

      eReader

      View online with eReader.

      eReader

      Media

      Figures

      Other

      Tables

      Share

      Share

      Share this Publication link

      Share on social media