research-article

Strategy purification and thresholding: effective non-equilibrium approaches for playing large games

Authors:

Sam Ganzfried,

Tuomas Sandholm,

Kevin WaughAuthors Info & Claims

AAMAS '12: Proceedings of the 11th International Conference on Autonomous Agents and Multiagent Systems - Volume 2

Pages 871 - 878

Published: 04 June 2012 Publication History

Get Access

Abstract

There has been significant recent interest in computing effective strategies for playing large imperfect-information games. Much prior work involves computing an approximate equilibrium strategy in a smaller abstract game, then playing this strategy in the full game (with the hope that it also well approximates an equilibrium in the full game). In this paper, we present a family of modifications to this approach that work by constructing non-equilibrium strategies in the abstract game, which are then played in the full game. Our new procedures, called purification and thresholding, modify the action probabilities of an abstract equilibrium by preferring the higher-probability actions. Using a variety of domains, we show that these approaches lead to significantly stronger play than the standard equilibrium approach. As one example, our program that uses purification came in first place in the two-player no-limit Texas Hold'em total bankroll division of the 2010 Annual Computer Poker Competition. Surprisingly, we also show that purification significantly improves performance (against the full equilibrium strategy) in random 4 x 4 matrix games using random 3 x 3 abstractions. We present several additional results (both theoretical and empirical). Overall, one can view these approaches as ways of achieving robustness against overfitting one's strategy to one's lossy abstraction. Perhaps surprisingly, the performance gains do not necessarily come at the expense of worst-case exploitability.

References

[1]

D. Billings, N. Burch, A. Davidson, R. Holte, J. Schaeffer, T. Schauenberg, and D. Szafron. Approximating game-theoretic optimal strategies for full-scale poker. In IJCAI, 2003.

Digital Library

Google Scholar

[2]

A. Gilpin and T. Sandholm. A competitive Texas Hold'em poker player via automated abstraction and real-time equilibrium computation. In AAAI, 2006.

Digital Library

Google Scholar

[3]

A. Gilpin and T. Sandholm. Lossless abstraction of imperfect information games. Journal of the ACM, 54(5), 2007.

Digital Library

Google Scholar

[4]

A. Gilpin and T. Sandholm. Expectation-based versus potential-aware automated abstraction in imperfect information games: An experimental comparison using poker. In AAAI, 2008. Short paper.

Digital Library

Google Scholar

[5]

A. Gilpin, T. Sandholm, and T. B. Sørensen. A heads-up no-limit Texas Hold'em poker player: Discretized betting models and automatically generated equilibrium-finding programs. In AAMAS, 2008.

Digital Library

Google Scholar

[6]

S. Hoda, A. Gilpin, J. Peña, and T. Sandholm. Smoothing techniques for computing Nash equilibria of sequential games. Mathematics of Operations Research, 35(2), 2010.

Digital Library

Google Scholar

[7]

M. Johanson, K. Waugh, M. Bowling, and M. Zinkevich. Accelerating best response calculation in large extensive games. In IJCAI, 2011.

Digital Library

Google Scholar

[8]

D. Koller, N. Megiddo, and B. von Stengel. Fast algorithms for finding randomized strategies in game trees. In STOC, 1994.

Digital Library

Google Scholar

[9]

R. Porter, E. Nudelman, and Y. Shoham. Simple search methods for finding a Nash equilibrium. Games and Economic Behavior, 63(2), 2008.

Google Scholar

[10]

D. Schnizlein, M. Bowling, and D. Szafron. Probabilistic state translation in extensive games with large action sets. In IJCAI, 2009.

Digital Library

Google Scholar

[11]

J. Shi and M. Littman. Abstraction methods for game theoretic poker. In CG '00: Revised Papers from the International Conference on Computers and Games. Springer-Verlag, 2002.

Digital Library

Google Scholar

[12]

K. Waugh, D. Schnizlein, M. Bowling, and D. Szafron. Abstraction pathologies in extensive games. In AAMAS, 2009.

Digital Library

Google Scholar

[13]

M. Zinkevich, M. Bowling, M. Johanson, and C. Piccione. Regret minimization in games with incomplete information. In NIPS, 2007.

Digital Library

Google Scholar

Cited By

View all

Ganzfried S(2016)Reflections on the first man vs. machine no-limit Texas hold 'em competitionACM SIGecom Exchanges10.1145/2904104.290410514:2(2-15)Online publication date: 16-Mar-2016
https://dl.acm.org/doi/10.1145/2904104.2904105
Crandall J(2015)Robust learning for repeated stochastic games via meta-gamingProceedings of the 24th International Conference on Artificial Intelligence10.5555/2832581.2832725(3416-3422)Online publication date: 25-Jul-2015
https://dl.acm.org/doi/10.5555/2832581.2832725
Ganzfried SSandholm TWeiss GYolum PBordini RElkind E(2015)Endgame Solving in Large Imperfect-Information GamesProceedings of the 2015 International Conference on Autonomous Agents and Multiagent Systems10.5555/2772879.2772888(37-45)Online publication date: 4-May-2015
https://dl.acm.org/doi/10.5555/2772879.2772888
Show More Cited By

Index Terms

Strategy purification and thresholding: effective non-equilibrium approaches for playing large games
1. Applied computing
  1. Law, social and behavioral sciences
2. Computing methodologies
  1. Artificial intelligence
    1. Distributed artificial intelligence

Recommendations

A near-optimal strategy for a heads-up no-limit Texas Hold'em poker tournament
AAMAS '07: Proceedings of the 6th international joint conference on Autonomous agents and multiagent systems

We analyze a heads-up no-limit Texas Hold'em poker tournament with a fixed small blind of 300 chips, a fixed big blind of 600 chips and a total amount of 8000 chips on the table (until recently, these parameters defined the heads-up endgame of sit-n-go ...
Lipschitz Games

The Lipschitz constant of a finite normal-form game is the maximal change in some player's payoff when a single opponent changes his strategy. We prove that games with small Lipschitz constant admit pure ε-equilibria, and pinpoint the maximal Lipschitz ...
Using counterfactual regret minimization to create competitive multiplayer poker agents
AAMAS '10: Proceedings of the 9th International Conference on Autonomous Agents and Multiagent Systems: volume 1 - Volume 1

Games are used to evaluate and advance Multiagent and Artificial Intelligence techniques. Most of these games are deterministic with perfect information (e.g. Chess and Checkers). A deterministic game has no chance element and in a perfect information ...

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

AAMAS '12: Proceedings of the 11th International Conference on Autonomous Agents and Multiagent Systems - Volume 2

June 2012

601 pages

ISBN:0981738125

In-Cooperation

SIGAI: ACM Special Interest Group on Artificial Intelligence

Publisher

International Foundation for Autonomous Agents and Multiagent Systems

Richland, SC

Publication History

Published: 04 June 2012

Check for updates

Author Tags

Qualifiers

Research-article

Conference

AAMAS 12

Sponsor:

The International Foundation for Autonomous Agents and Multiagent Systems

AAMAS 12: International Conference on Autonomous Agents and Multiagent Systems

June 4 - 8, 2012

Valencia, Spain

Acceptance Rates

Overall Acceptance Rate 1,155 of 5,036 submissions, 23%

Contributors

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

6
Total Citations
View Citations
89
Total Downloads

Downloads (Last 12 months)0
Downloads (Last 6 weeks)0

Reflects downloads up to 03 Mar 2025

Other Metrics

View Author Metrics

Citations

Cited By

View all

Ganzfried S(2016)Reflections on the first man vs. machine no-limit Texas hold 'em competitionACM SIGecom Exchanges10.1145/2904104.290410514:2(2-15)Online publication date: 16-Mar-2016
https://dl.acm.org/doi/10.1145/2904104.2904105
Crandall J(2015)Robust learning for repeated stochastic games via meta-gamingProceedings of the 24th International Conference on Artificial Intelligence10.5555/2832581.2832725(3416-3422)Online publication date: 25-Jul-2015
https://dl.acm.org/doi/10.5555/2832581.2832725
Ganzfried SSandholm TWeiss GYolum PBordini RElkind E(2015)Endgame Solving in Large Imperfect-Information GamesProceedings of the 2015 International Conference on Autonomous Agents and Multiagent Systems10.5555/2772879.2772888(37-45)Online publication date: 4-May-2015
https://dl.acm.org/doi/10.5555/2772879.2772888
Brown NGanzfried SSandholm TWeiss GYolum PBordini RElkind E(2015)Hierarchical Abstraction, Distributed Equilibrium Computation, and Post-Processing, with Application to a Champion No-Limit Texas Hold'em AgentProceedings of the 2015 International Conference on Autonomous Agents and Multiagent Systems10.5555/2772879.2772885(7-15)Online publication date: 4-May-2015
https://dl.acm.org/doi/10.5555/2772879.2772885
Rabinovich ZNaroditskiy VGerding EJennings N(2013)Computing pure Bayesian-Nash equilibria in games with finite actions and continuous typesArtificial Intelligence10.1016/j.artint.2012.09.007195(106-139)Online publication date: 1-Feb-2013
https://dl.acm.org/doi/10.1016/j.artint.2012.09.007
Sandholm TSingh SFaltings BLeyton-Brown KIpeirotis P(2012)Lossy stochastic game abstraction with boundsProceedings of the 13th ACM Conference on Electronic Commerce10.1145/2229012.2229079(880-897)Online publication date: 4-Jun-2012
https://dl.acm.org/doi/10.1145/2229012.2229079

View Options

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Abstract

References

Cited By

Index Terms

Recommendations

A near-optimal strategy for a heads-up no-limit Texas Hold'em poker tournament

Lipschitz Games

Using counterfactual regret minimization to create competitive multiplayer poker agents

Comments

Information

Published In

Sponsors

In-Cooperation

Publisher

Publication History

Check for updates

Author Tags

Qualifiers

Conference

Acceptance Rates

Contributors

Other Metrics

Bibliometrics

Article Metrics

Other Metrics

Citations

Cited By

Login options

Full Access

View options

PDF

eReader

Share

Share this Publication link

Share on social media

Affiliations