Abstract
The ancient oriental game of Go has long been considered a grand challenge for artificial intelligence. For decades, computer Go has defied the classical methods in game tree search that worked so successfully for chess and checkers. However, recent play in computer Go has been transformed by a new paradigm for tree search based on Monte-Carlo methods. Programs based on Monte-Carlo tree search now play at human-master levels and are beginning to challenge top professional players. In this paper, we describe the leading algorithms for Monte-Carlo tree search and explain how they have advanced the state of the art in computer Go.
References
25
Referenced
124
- Special issue on Monte Carlo techniques and computer Go. In C.-S. Lee, M. Müller, and O. Teytaud, eds , IEEE Trans. Comput. Intell. AI in Games , 2 ( 2010 ). Special issue on Monte Carlo techniques and computer Go. In C.-S. Lee, M. Müller, and O. Teytaud, eds, IEEE Trans. Comput. Intell. AI in Games, 2 (2010). / IEEE Trans. Comput. Intell. AI in Games / Carlo techniques and computer Go. In C.-S. Lee, M. Müller, and O. Teytaud, eds by Special (2010)
10.1109/34.44404
10.1023/A:1013689704352
10.5555/1950322.1950327
- Bouzy , B. , Chaslot , G. Bayesian generation and integration of k-nearest-neighbor patterns for 19 × 19 Go . In IEEE Symposium on Computational Intelligence and Games (CIG-05) ( 2005 ). Bouzy, B., Chaslot, G. Bayesian generation and integration of k-nearest-neighbor patterns for 19 × 19 Go. In IEEE Symposium on Computational Intelligence and Games (CIG-05) (2005). / IEEE Symposium on Computational Intelligence and Games (CIG-05) by Bouzy B. (2005)
- Bouzy , B. , Helmstetter , B. Monte-Carlo Go developments . In 10th International Conference on Advances in Computer Games (ACG-03) ( 2003 ), 159--174. Bouzy, B., Helmstetter, B. Monte-Carlo Go developments. In 10th International Conference on Advances in Computer Games (ACG-03) (2003), 159--174. / 10th International Conference on Advances in Computer Games (ACG-03) by Bouzy B. (2003)
- Brügmann , B. Monte-Carlo Go . Technical report , Max Planck Institute of Physics , 1993 . Brügmann, B. Monte-Carlo Go. Technical report, Max Planck Institute of Physics, 1993. / Technical report by Brügmann B. (1993)
- Bubeck , S. , Munos , R. , Stoltz , G. , Szepesvári , C. Online optimization in X-armed bandits. In Advances in Neural Information Processing Systems 22 (NIPS-22), D. Koller and D. Schuurmans and Y. Bengio and L. Bottou, eds . MIT Press , 2009 , 201--208. Bubeck, S., Munos, R., Stoltz, G., Szepesvári, C. Online optimization in X-armed bandits. In Advances in Neural Information Processing Systems 22 (NIPS-22), D. Koller and D. Schuurmans and Y. Bengio and L. Bottou, eds. MIT Press, 2009, 201--208. / MIT Press by Bubeck S. (2009)
- Cazenave , T. , Balbo , F. , Pinson , S. Monte-Carlo bus regulation . In 12th International IEEE Conference on Intelligent Transportation Systems ( 2009 ), 340--345. Cazenave, T., Balbo, F., Pinson, S. Monte-Carlo bus regulation. In 12th International IEEE Conference on Intelligent Transportation Systems (2009), 340--345. / 12th International IEEE Conference on Intelligent Transportation Systems by Cazenave T. (2009)
10.5555/1944566.1944583
- Coquelin , P.A. , Munos , R. Bandit algorithms for tree search . In 23rd Conference on Uncertainty in Artificial Intelligence (UAI-07) ( 2007 ), 67--74. Coquelin, P.A., Munos, R. Bandit algorithms for tree search. In 23rd Conference on Uncertainty in Artificial Intelligence (UAI-07) (2007), 67--74. / 23rd Conference on Uncertainty in Artificial Intelligence (UAI-07) by Coquelin P.A. (2007)
10.5555/1777826.1777833
- Coulom , R . Computing Elo ratings of move patterns in the game of Go . Int. Comput. Game. Assoc. J. 30 , 4 ( 2007 ), 198--208. Coulom, R. Computing Elo ratings of move patterns in the game of Go. Int. Comput. Game. Assoc. J. 30, 4 (2007), 198--208. / Int. Comput. Game. Assoc. J. / Computing Elo ratings of move patterns in the game of Go by Coulom R (2007)
- Finnsson , H. , Björnsson , Y. Simulation-based approach to general game playing . In 23rd AAAI Conference on Artificial Intelligence (AAAI-08) ( 2008 ), 259--264. Finnsson, H., Björnsson, Y. Simulation-based approach to general game playing. In 23rd AAAI Conference on Artificial Intelligence (AAAI-08) (2008), 259--264. / 23rd AAAI Conference on Artificial Intelligence (AAAI-08) by Finnsson H. (2008)
10.1016/j.artint.2011.03.007
- Gelly , S. , Wang , Y. , Munos , R. , Teytaud , O. Modification of UCT with Patterns in Monte-Carlo Go. Rapport de recherche INRIA RR-6062 , 2006 . Gelly, S., Wang, Y., Munos, R., Teytaud, O. Modification of UCT with Patterns in Monte-Carlo Go. Rapport de recherche INRIA RR-6062, 2006. / Modification of UCT with Patterns in Monte-Carlo Go. Rapport de recherche INRIA RR-6062 by Gelly S. (2006)
- Huang , S. , Coulom , R. , Lin , S. Monte-Carlo simulation balancing in practice . In 7th International Conference on Computers and, Games (CG-09) ( 2009 ), 119--126. Huang, S., Coulom, R., Lin, S. Monte-Carlo simulation balancing in practice. In 7th International Conference on Computers and, Games (CG-09) (2009), 119--126. / 7th International Conference on Computers and, Games (CG-09) by Huang S. (2009)
10.1007/11871842_29
10.1016/0196-8858(85)90002-8
10.5555/1661445.1661729
10.1090/S0002-9904-1952-09620-8
- Schaeffer , J . The games computers (and people) play . Adv. Comput. , 52 ( 2000 ), 190 -- 268 . Schaeffer, J. The games computers (and people) play. Adv. Comput., 52 (2000), 190--268. / Adv. Comput. / The games computers (and people) play by Schaeffer J (2000)
10.5555/1736406.1736447
10.1109/TSMC.1973.4309272
10.5555/1597538.1597631
Dates
Type | When |
---|---|
Created | 13 years, 6 months ago (Feb. 22, 2012, 1:42 p.m.) |
Deposited | 2 months, 1 week ago (June 18, 2025, 5:54 a.m.) |
Indexed | 1 month, 3 weeks ago (July 5, 2025, 12:46 a.m.) |
Issued | 13 years, 5 months ago (March 1, 2012) |
Published | 13 years, 5 months ago (March 1, 2012) |
Published Online | 13 years, 5 months ago (March 1, 2012) |
Published Print | 13 years, 5 months ago (March 1, 2012) |
Funders
2
Seventh Framework Programme
10.13039/501100004963
Region: Europe
gov (National government)
Labels
5
- EC Seventh Framework Programme
- European Commission Seventh Framework Programme
- EU Seventh Framework Programme
- European Union Seventh Framework Programme
- FP7
Awards
1
- 216886
Sixth Framework Programme
10.13039/501100004965
Region: Europe
gov (National government)
Labels
5
- EC Sixth Framework Programme
- European Commission Sixth Framework Programme
- EU Sixth Framework Programme
- European Union Sixth Framework Programme
- FP6
Awards
1
- IST 2002-506778
@article{Gelly_2012, title={The grand challenge of computer Go: Monte Carlo tree search and extensions}, volume={55}, ISSN={1557-7317}, url={http://dx.doi.org/10.1145/2093548.2093574}, DOI={10.1145/2093548.2093574}, number={3}, journal={Communications of the ACM}, publisher={Association for Computing Machinery (ACM)}, author={Gelly, Sylvain and Kocsis, Levente and Schoenauer, Marc and Sebag, Michèle and Silver, David and Szepesvári, Csaba and Teytaud, Olivier}, year={2012}, month=mar, pages={106–113} }