Autor:	Kim, Juho
Rok vydání:	2024
Předmět:	Computer Science - Computer Science and Game Theory Computer Science - Machine Learning
Druh dokumentu:	Working Paper
Popis:	Counterfactual regret minimization is a family of algorithms of no-regret learning dynamics capable of solving large-scale imperfect information games. We propose implementing this algorithm as a series of dense and sparse matrix and vector operations, thereby making it highly parallelizable for a graphical processing unit, at a cost of higher memory usages. Our experiments show that our implementation performs up to about 352.5 times faster than OpenSpiel's Python implementation and up to about 22.2 times faster than OpenSpiel's C++ implementation and the speedup becomes more pronounced as the size of the game being solved grows.
Databáze:	arXiv
Externí odkaz:	http://arxiv.org/abs/2408.14778 Zobrazit plný text záznamu View this record from Arxiv