CadiaPlayer: Search-Control Techniques

Hilmar Finnsson¹ &
Yngvi Björnsson¹

258 Accesses
8 Citations
Explore all metrics

Abstract

Effective search control is one of the key components of any successful simulation-based game-playing program. In General Game Playing (GGP), learning of useful search-control knowledge is a particularly challenging task because it must be done in real-time during online play. In here we describe the search-control techniques used in the 2010 version of the GGP agent CadiaPlayer, and show how they have evolved over the years to become increasingly effective and robust across a wide range of games. In particular, we present a new combined search-control scheme (RAVE/MAST/FAST) for biasing action selection. The scheme proves quite effective on a wide range of games including chess-like games, which have up until now proved quite challenging for simulation-based GGP agents.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Artificial Intelligence

Notes

Skirmish is a chess-like game. We used the variation played in the 2007 GGP finals where each player has two knights, two bishops, two rooks, and four pawns (can only move by capturing). The objective is to capture as many of the opponent’s pieces as possible. Breakthrough is played on a chess board where the players, armed with two ranks of pawns, try to be the first to break through to reach the opponent’s back rank. The pawns can move forward and diagonally, but can only capture diagonally. Chinook is a variant of Breakthrough played with checkers pawns and the added twist that two independent games are played simultaneously, one on the white squares and another on the black ones. 3D Tic-Tac-Toe and Nine Board Tic-Tac-Toe are variants of Tic-Tac-Toe; the former is played on a 4×4×4 cube, and in the latter 9 Tic-Tac-Toe boards are placed in a 3×3 grid formation, and on each turn a player can play only on the board having the same coordinate as the last cell marked by the opponent (e.g., if the opponent marked a center cell in one of the boards, the player must on his turn play in the Tic-Tac-Toe board in the center). TTCC4 is a hybrid of several games where each player has a chess pawn, a chess knight, and a checkers king—these pieces respawn on their start square if captured. Instead of moving a piece a player can choose to drop a disk onto the board like in Connect 4 (captured disks do not respawn). The goal is to form a 3-in-a-row formation with your pieces anywhere on the 3×3 center squares of the table. Full game descriptions can be found on the Dresden GGP server (http://euklid.inf.tu-dresden.de:8180/ggpserver).

References

Banerjee B, Stone P (2007) General game learning using knowledge transfer. In: Veloso MM (ed) 20th IJCAI, pp 672–677
Google Scholar
Björnsson Y, Finnsson H (2009) CADIAPlayer: a simulation-based general game player. IEEE Trans Comput Intell AI in Games 1(1):4–15
Article Google Scholar
Chaslot G, Winands M, van den Herik JH, Uiterwijk J, Bouzy B (2007) Progressive strategies for Monte-Carlo tree search. In: 10th JCIS, heuristic search and computer game playing session
Google Scholar
Clune J (2007) Heuristic evaluation functions for general game playing. In: Holte RC, Howe A (eds) 22nd AAAI. AAAI Press, Menlo Park, pp 1134–1139
Google Scholar
Coulom R (2006) Efficient selectivity and backup operators in Monte-Carlo tree search. In: van den Herik HJ, Ciancarini P, Donkers HHLM (eds) CG2006. Springer, Berlin, pp 72–83
Google Scholar
Cox E, Schkufza E, Madsen Ryan MG (2009) Factoring general games using propositional automata. In: GIGA’09 the IJCAI workshop on general game playing
Google Scholar
Enzenberger M, Müller M (2009) Fuego—an open-source framework for board games and Go engine based on Monte-Carlo tree search. Tech Rep 09-08, Dept of Computing Science, University of Alberta
Finnsson H, Björnsson Y (2008) Simulation-based approach to general game playing. In: Fox D, Gomes CP (eds) 23rd AAAI. AAAI Press, Menlo Park, pp 259–264
Google Scholar
Finnsson H, Björnsson Y (2010) Learning simulation-control in general game-playing agents. In: Fox M, Poole D (eds) 24th AAAI. AAAI Press, Menlo Park, pp 954–959
Google Scholar
Gelly S, Silver D (2007) Combining online and offline knowledge in UCT. In: Ghahramani Z (ed) 24th ICML, vol 227. ACM, New York, pp 273–280
Chapter Google Scholar
Gelly S, Wang Y, Munos R, Teytaud O (2006) Modification of UCT with patterns in Monte-Carlo Go. Technical Report 6062, INRIA
Genesereth MR, Love N, Pell B (2005) General game playing: overview of the AAAI competition. AI Mag 26(2):62–72
Google Scholar
Günther M, Schiffel S, Thielscher M (2009) Factoring general games. In: GIGA’09 the IJCAI workshop on general game playing
Google Scholar
Kirci M, Schaeffer J, Sturtevant N (2009) Feature learning using state differences. In: GIGA’09 the IJCAI workshop on general game playing
Google Scholar
Kocsis L, Szepesvári C (2006) Bandit based Monte-Carlo planning. In: ECML, pp 282–293
Google Scholar
Kuhlmann G, Dresner K, Stone P (2006) Automatic heuristic construction in a complete general game player. In: 21st AAAI, pp 1457–1462
Google Scholar
Kuhlmann G, Stone P (2007) Graph-based domain mapping for transfer learning in general games. In: 18th ECML
Google Scholar
Lorentz RJ (2008) Amazons discover Monte-Carlo. In: van den Herik HJ, Xu X, Ma Z, Winands MHM (eds) CG2008. Lecture notes in computer science, vol 5131. Springer, Berlin, pp 13–24
Google Scholar
Love N, Hinrichs T, Genesereth M (2006) General game playing: game description language specification. Technical Report, Stanford University, 4 April 2006
Pell B (1996) A strategic metagame player for general chess-like games. Comput Intell 12:177–198
Article Google Scholar
Schiffel S (2010) Symmetry detection in general game playing. In: Fox M, Poole D (eds) 24th AAAI, pp 980–985. AAAI Press, Menlo Park
Google Scholar
Schiffel S, Thielscher M (2007) Automatic construction of a heuristic search function for general game playing. In: Veloso MM (ed) 7th IJCAI workshop on nonmontonic reasoning, action and change (NRAC07)
Google Scholar
Schiffel S, Thielscher M (2007) Fluxplayer: a successful general game player. In: Holte RC, Howe A (eds) 22nd AAAI, AAAI Press, Menlo Park, pp 1191–1196
Google Scholar
Schiffel S, Thielscher M (2009) Automated theorem proving for general game playing. In: Boutilier C (ed) 21st IJCAI. Morgan Kaufmann, San Francisco, pp 911–916
Google Scholar
Sharma S, Kobti Z, Goodwin S (2008) Knowledge generation for improving simulations in UCT for general game playing. In: AI 2008: advances in artificial intelligence. Springer, Berlin, pp 49–55
Chapter Google Scholar
Sutton RS (1988) Learning to predict by the methods of temporal differences. Mach Learn 3:9–44
Google Scholar
Thielscher M (2010) A general game description language for incomplete information games. In: Fox M, Poole D (eds) 24th AAAI. AAAI Press, Menlo Park, pp 994–999
Google Scholar
Thielscher M, Voigt S (2010) A temporal proof system for general game playing. In: Fox M, Poole D (eds) 24th AAAI. AAAI Press, Menlo Park, pp 1000–1005
Google Scholar
Waugh K (2009) Faster state manipulation in general games using generated code. In: GIGA’09 the IJCAI workshop on general game playing
Google Scholar

Download references

Author information

Authors and Affiliations

Reykjavík University, Menntavegur 1, 101, Reykjavík, Iceland
Hilmar Finnsson & Yngvi Björnsson

Authors

Hilmar Finnsson
View author publications
You can also search for this author in PubMed Google Scholar
Yngvi Björnsson
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Yngvi Björnsson.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Finnsson, H., Björnsson, Y. CadiaPlayer: Search-Control Techniques. Künstl Intell 25, 9–16 (2011). https://doi.org/10.1007/s13218-010-0080-9

Download citation

Received: 31 July 2010
Accepted: 11 September 2010
Published: 11 January 2011
Issue Date: March 2011
DOI: https://doi.org/10.1007/s13218-010-0080-9

Abstract

Access this article

Subscribe and save

Buy Now

Similar content being viewed by others

General Game Playing

General Game Playing

Artificial Intelligence for Games

Notes

References

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Keywords

Subscribe and save

Buy Now

Navigation

CadiaPlayer: Search-Control Techniques

Abstract

Access this article

Subscribe and save

Buy Now

Similar content being viewed by others

General Game Playing

General Game Playing

Artificial Intelligence for Games

Explore related subjects

Notes

References

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Subscribe and save

Buy Now

Search

Navigation