Výsledky vyhledávání - "MULLER, Paul"

Report

Combining Tree-Search, Generative Models, and Nash Bargaining Concepts in Game-Theoretic Reinforcement Learning

Autor: Li, Zun, Lanctot, Marc, McKee, Kevin R., Marris, Luke, Gemp, Ian, Hennes, Daniel, Muller, Paul, Larson, Kate, Bachrach, Yoram, Wellman, Michael P.

Multiagent reinforcement learning (MARL) has benefited significantly from population-based and game-theoretic training regimes. One approach, Policy-Space Response Oracles (PSRO), employs standard reinforcement learning to compute response policies v

Externí odkaz: http://arxiv.org/abs/2302.00797

Zobrazit plný text záznamu

Report

Developing, Evaluating and Scaling Learning Agents in Multi-Agent Environments

The Game Theory & Multi-Agent team at DeepMind studies several aspects of multi-agent learning ranging from computing approximations to fundamental concepts in game theory to simulating social dilemmas in rich spatial environments and training 3-d hu

Externí odkaz: http://arxiv.org/abs/2209.10958

Zobrazit plný text záznamu

Report

Autor: Muller, Paul, Elie, Romuald, Rowland, Mark, Lauriere, Mathieu, Perolat, Julien, Perrin, Sarah, Geist, Matthieu, Piliouras, Georgios, Pietquin, Olivier, Tuyls, Karl

The designs of many large-scale systems today, from traffic routing environments to smart grids, rely on game-theoretic equilibrium concepts. However, as the size of an $N$-player game typically grows exponentially with $N$, standard game theoretic a

Externí odkaz: http://arxiv.org/abs/2208.10138

Zobrazit plný text záznamu

Report

Mastering the Game of Stratego with Model-Free Multiagent Reinforcement Learning

We introduce DeepNash, an autonomous agent capable of learning to play the imperfect information game Stratego from scratch, up to a human expert level. Stratego is one of the few iconic board games that Artificial Intelligence (AI) has not yet maste

Externí odkaz: http://arxiv.org/abs/2206.15378

Zobrazit plný text záznamu

Plný text ve formátu HTML

Report

Learning in Mean Field Games: A Survey

Autor: Laurière, Mathieu, Perrin, Sarah, Pérolat, Julien, Girgin, Sertan, Muller, Paul, Élie, Romuald, Geist, Matthieu, Pietquin, Olivier

Non-cooperative and cooperative games with a very large number of players have many applications but remain generally intractable when the number of players increases. Introduced by Lasry and Lions, and Huang, Caines and Malham\'e, Mean Field Games (

Externí odkaz: http://arxiv.org/abs/2205.12944

Zobrazit plný text záznamu

Report

Scalable Deep Reinforcement Learning Algorithms for Mean Field Games

Autor: Laurière, Mathieu, Perrin, Sarah, Girgin, Sertan, Muller, Paul, Jain, Ayush, Cabannes, Theophile, Piliouras, Georgios, Pérolat, Julien, Élie, Romuald, Pietquin, Olivier, Geist, Matthieu

Mean Field Games (MFGs) have been introduced to efficiently approximate games with very large populations of strategic agents. Recently, the question of learning equilibria in MFGs has gained momentum, particularly using model-free reinforcement lear

Externí odkaz: http://arxiv.org/abs/2203.11973

Zobrazit plný text záznamu

Report

Learning Equilibria in Mean-Field Games: Introducing Mean-Field PSRO

Autor: Muller, Paul, Rowland, Mark, Elie, Romuald, Piliouras, Georgios, Perolat, Julien, Lauriere, Mathieu, Marinier, Raphael, Pietquin, Olivier, Tuyls, Karl

Recent advances in multiagent learning have seen the introduction ofa family of algorithms that revolve around the population-based trainingmethod PSRO, showing convergence to Nash, correlated and coarse corre-lated equilibria. Notably, when the numb

Externí odkaz: http://arxiv.org/abs/2111.08350

Zobrazit plný text záznamu

Report

Multi-Agent Training beyond Zero-Sum with Correlated Equilibrium Meta-Solvers

Autor: Marris, Luke, Muller, Paul, Lanctot, Marc, Tuyls, Karl, Graepel, Thore

Two-player, constant-sum games are well studied in the literature, but there has been limited progress outside of this setting. We propose Joint Policy-Space Response Oracles (JPSRO), an algorithm for training agents in n-player, general-sum extensiv

Externí odkaz: http://arxiv.org/abs/2106.09435

Zobrazit plný text záznamu

Report

Time-series Imputation of Temporally-occluded Multiagent Trajectories

Autor: Omidshafiei, Shayegan, Hennes, Daniel, Garnelo, Marta, Tarassov, Eugene, Wang, Zhe, Elie, Romuald, Connor, Jerome T., Muller, Paul, Graham, Ian, Spearman, William, Tuyls, Karl

In multiagent environments, several decision-making individuals interact while adhering to the dynamics constraints imposed by the environment. These interactions, combined with the potential stochasticity of the agents' decision-making processes, ma

Externí odkaz: http://arxiv.org/abs/2106.04219

Zobrazit plný text záznamu

Vyhledávací nástroje:

Upřesnit hledání