On the Scalability of Parallel UCT

Richard B. Segal¹⁸

Part of the book series: Lecture Notes in Computer Science ((LNTCS,volume 6515))

Included in the following conference series:

International Conference on Computers and Games

1727 Accesses
10 Citations

Abstract

The parallelization of MCTS across multiple-machines has proven surprisingly difficult. The limitations of existing algorithms were evident in the 2009 Computer Olympiad where Zen using a single four-core machine defeated both Fuego with ten eight-core machines, and Mogo with twenty thirty-two core machines. This paper investigates the limits of parallel MCTS in order to understand why distributed parallelism has proven so difficult and to pave the way towards future distributed algorithms with better scaling. We first analyze the single-threaded scaling of Fuego and find that there is an upper bound on the play-quality improvements which can come from additional search. We then analyze the scaling of an idealized N-core shared memory machine to determine the maximum amount of parallelism supported by MCTS. We show that parallel speedup depends critically on how much time is given to each player. We use this relationship to predict parallel scaling for time scales beyond what can be empirically evaluated due to the immense computation required. Our results show that MCTS can scale nearly perfectly to at least 64 threads when combined with virtual loss, but without virtual loss scaling is limited to just eight threads. We also find that for competition time controls scaling to thousands of threads is impossible not necessarily due to MCTS not scaling, but because high levels of parallelism can start to bump up against the upper performance bound of Fuego itself.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Is Today’s Public Cloud Suited to Deploy Hardcore Realtime Services?

Parallel Monte Carlo Tree Search in General Video Game Playing

Multi-Level Parallelization of the Fragment Molecular Orbital Method in GAMESS

References

Gelly, S., Hoock, J.B., Rimmel, A., Teytaud, O., Kalemkarian, Y.: The parallelization of monte-carlo planning - parallelization of mc-planning. In: ICINCO-ICSO, pp. 244–249 (2008)
Google Scholar
Kocsis, L., Szepesvári, C.: Bandit based monte-carlo planning. In: Fürnkranz, J., Scheffer, T., Spiliopoulou, M. (eds.) ECML 2006. LNCS (LNAI), vol. 4212, pp. 282–293. Springer, Heidelberg (2006)
Chapter Google Scholar
Gelly, S., Silver, D.: Combining online and offline learning in UCT. In: 17th International Conference on Machine Learning, pp. 273–280 (2007)
Google Scholar
Gelly, S., Wang, Y., Munos, R., Teytaud, O.: Modification of UCT with patterns in Monte-Carlo Go. Technical Report 6062, INRIA, France (2006)
Google Scholar
Chaslot, G.M.J.-B., Winands, M.H.M., van den Herik, H.J.: Parallel monte-carlo tree search. In: van den Herik, H.J., Xu, X., Ma, Z., Winands, M.H.M. (eds.) CG 2008. LNCS, vol. 5131, pp. 60–71. Springer, Heidelberg (2008)
Chapter Google Scholar
Enzenberger, M., Müller, M.: A lock-free multithreaded monte-carlo tree search algorithm. In: Advances in Computer Games 12 (2009)
Google Scholar
Auer, P., Cesa-Binachi, N., Fischer, P.: Finite-time analysis of the multiarmed bandit problem. Machine Learning 47, 235–256 (2002)
Article MATH Google Scholar
Dailey, D.: 9x9 scalability study (2008), http://cgos.boardspace.net/study/index.html
Cazenave, T., Jouandeau, N.: A parallel monte-carlo tree search algorithm. In: van den Herik, H.J., Xu, X., Ma, Z., Winands, M.H.M. (eds.) CG 2008. LNCS, vol. 5131, pp. 72–80. Springer, Heidelberg (2008)
Chapter Google Scholar
Teytaud, O.: Parallel algorithms. Posting to the Computer Go mailing list (2008), http://computer-go.org/pipermail/computer-go/2008-May/015074.html

Download references

Author information

Authors and Affiliations

IBM Research, Yorktown Heights, NY, USA
Richard B. Segal

Authors

Richard B. Segal
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Tilburg Center for Cognition and Communication (TiCC), Tilburg University, P.O. Box 90153, 5000LE, Tilburg, The Netherlands
H. Jaap van den Herik & Aske Plaat &
Japan Advanced Institute of Science and Technology, Research Unit for Computers and Games, 1-1, Asahidai, 923-1292, Nomi, Ishikawa, Japan
Hiroyuki Iida

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Segal, R.B. (2011). On the Scalability of Parallel UCT. In: van den Herik, H.J., Iida, H., Plaat, A. (eds) Computers and Games. CG 2010. Lecture Notes in Computer Science, vol 6515. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-17928-0_4

Download citation

DOI: https://doi.org/10.1007/978-3-642-17928-0_4
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-17927-3
Online ISBN: 978-3-642-17928-0
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

On the Scalability of Parallel UCT

Abstract

Access this chapter

Subscribe and save

Buy Now

Preview

Similar content being viewed by others

Is Today’s Public Cloud Suited to Deploy Hardcore Realtime Services?

Parallel Monte Carlo Tree Search in General Video Game Playing

Multi-Level Parallelization of the Fragment Molecular Orbital Method in GAMESS

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Subscribe and save

Buy Now

Navigation

On the Scalability of Parallel UCT

Abstract

Access this chapter

Subscribe and save

Buy Now

Preview

Similar content being viewed by others

Is Today’s Public Cloud Suited to Deploy Hardcore Realtime Services?

Parallel Monte Carlo Tree Search in General Video Game Playing

Multi-Level Parallelization of the Fragment Molecular Orbital Method in GAMESS

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation