Lanctot, 2013 - Google Patents

Monte Carlo sampling and regret minimization for equilibrium computation and decision-making in large extensive form games

Lanctot, 2013

View PDF

Document ID: 7920299906710639627
Author: Lanctot M
Publication year: 2013

External Links

Cited by

Snippet

In this thesis, we investigate the problem of decision-making in large two-player zero-sum games using Monte Carlo sampling and regret minimization methods. We demonstrate four major contributions. The first is Monte Carlo Counterfactual Regret Minimization (MC-CFR) …

Continue reading at era.library.ualberta.ca (PDF) (other versions)

238000005070 sampling 0 title abstract description 123

Classifications

- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computer systems based on biological models
- G06N3/12—Computer systems based on biological models using genetic models
- G06N3/126—Genetic algorithms, i.e. information processing using digital simulations of the genetic system
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N5/00—Computer systems utilising knowledge based models
- G06N5/04—Inference methods or devices
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computer systems based on biological models
- G06N3/02—Computer systems based on biological models using neural network models

Similar Documents

Publication	Publication Date	Title
Lanctot	2013	Monte Carlo sampling and regret minimization for equilibrium computation and decision-making in large extensive form games
Holcomb et al.	2018	Overview on deepmind and its alphago zero ai
Bošanský et al.	2016	Algorithms for computing strategies in two-player simultaneous move games
Mandziuk	2010	Knowledge-free and learning-based methods in intelligent game playing
Perick et al.	2012	Comparison of different selection strategies in monte-carlo tree search for the game of tron
Lanctot et al.	2014	Monte Carlo tree search in simultaneous move games with applications to Goofspiel
Brown et al.	2016	Strategy-based warm starting for regret minimization in games
Whitehouse	2014	Monte Carlo tree search for games with hidden information and uncertainty
Ponsen et al.	2009	An evolutionary game-theoretic analysis of poker strategies
Johanson	2016	Robust strategies and counter-strategies: from superhuman to optimal play
Schmid	2021	Search in imperfect information games
Mańdziuk	2007	Computational intelligence in mind games
Lanctot et al.	2014	Search in imperfect information games using online monte carlo counterfactual regret minimization
Dobre et al.	2015	Online learning and mining human play in complex games
Bitan et al.	2017	Combining prediction of human decisions with ISMCTS in imperfect information games
Bailis et al.	2014	Learning to play monopoly: A reinforcement learning approach
Wang et al.	2017	Belief-state monte Carlo tree search for phantom go
Lin et al.	2018	Multi-agent inverse reinforcement learning for general-sum stochastic games
Kira et al.	2019	A dynamic programming algorithm for optimizing baseball strategies
Cao et al.	2019	UCT-ADP Progressive Bias Algorithm for Solving Gomoku
Li et al.	2023	D2CFR: Minimize counterfactual regret with deep dueling neural network
Zhang et al.	2024	A Survey on Self-play Methods in Reinforcement Learning
Levinson et al.	1992	Adaptive-predictive game-playing programs
Liu et al.	2023	Soft-actor-attention-critic based on unknown agent action prediction for multi-agent collaborative confrontation
Schmid	2013	Game theory and poker