On-Line Parameter Tuning for Monte-Carlo Tree Search in General Game Playing
Autor: | Sironi, Chiara F., Winands, Mark H. M., Cazenave, Tristan, Winands, Mark H.M., Saffidine, Abdallah |
---|---|
Přispěvatelé: | DKE Scientific staff, RS: FSE DACS NSO |
Jazyk: | angličtina |
Rok vydání: | 2018 |
Předmět: |
Computer science
Computation Monte Carlo tree search Context (language use) 0102 computer and information sciences 02 engineering and technology computer.software_genre 01 natural sciences General game playing Tree (data structure) 010201 computation theory & mathematics Line (geometry) 0202 electrical engineering electronic engineering information engineering 020201 artificial intelligence & image processing computer Algorithm |
Zdroj: | Computer Games: 6th Workshop, CGW 2017, Held in Conjunction with the 26th International Conference on Artificial Intelligence, IJCAI 2017, Melbourne, VIC, Australia, August, 20, 2017, Revised Selected Papers, 75-95 STARTPAGE=75;ENDPAGE=95;TITLE=Computer Games Communications in Computer and Information Science ISBN: 9783319759302 CGW@IJCAI STARTPAGE=235;ENDPAGE=236;TITLE=30th Benelux Conference on Artificial Intelligence |
Popis: | Many enhancements have been proposed for Monte-Carlo Tree Search (MCTS). Some of them have been applied successfully in the context of General Game Playing (GGP). MCTS and its enhancements are usually controlled by multiple parameters that require extensive and time-consuming computation to be tuned in advance. Moreover, in GGP optimal parameter values may vary depending on the considered game. This paper proposes a method to automatically tune search-control parameters on-line for GGP. This method considers the tuning problem as a Combinatorial Multi-Armed Bandit (CMAB). Four strategies designed to deal with CMABs are evaluated for this particular problem. Experiments show that on-line tuning in GGP almost reaches the same performance as off-line tuning. It can be considered as a valid alternative for domains where off-line parameter tuning is costly or infeasible. |
Databáze: | OpenAIRE |
Externí odkaz: |