Combining Tree-Search, Generative Models, and Nash Bargaining Concepts in Game-Theoretic Reinforcement Learning

Autor:	Li, Zun, Lanctot, Marc, McKee, Kevin R., Marris, Luke, Gemp, Ian, Hennes, Daniel, Muller, Paul, Larson, Kate, Bachrach, Yoram, Wellman, Michael P.
Rok vydání:	2023
Předmět:	Computer Science - Artificial Intelligence Computer Science - Computer Science and Game Theory Computer Science - Machine Learning Computer Science - Multiagent Systems
Druh dokumentu:	Working Paper
Popis:	Multiagent reinforcement learning (MARL) has benefited significantly from population-based and game-theoretic training regimes. One approach, Policy-Space Response Oracles (PSRO), employs standard reinforcement learning to compute response policies via approximate best responses and combines them via meta-strategy selection. We augment PSRO by adding a novel search procedure with generative sampling of world states, and introduce two new meta-strategy solvers based on the Nash bargaining solution. We evaluate PSRO's ability to compute approximate Nash equilibrium, and its performance in two negotiation games: Colored Trails, and Deal or No Deal. We conduct behavioral studies where human participants negotiate with our agents ($N = 346$). We find that search with generative modeling finds stronger policies during both training time and test time, enables online Bayesian co-player prediction, and can produce agents that achieve comparable social welfare negotiating with humans as humans trading among themselves.
Databáze:	arXiv
Externí odkaz:	http://arxiv.org/abs/2302.00797 Zobrazit plný text záznamu View this record from Arxiv