Výsledky vyhledávání - "Perolat, Julien"

Report

Population-based Evaluation in Repeated Rock-Paper-Scissors as a Benchmark for Multiagent Reinforcement Learning

Autor: Lanctot, Marc, Schultz, John, Burch, Neil, Smith, Max Olan, Hennes, Daniel, Anthony, Thomas, Perolat, Julien

Progress in fields of machine learning and adversarial planning has benefited significantly from benchmark domains, from checkers and the classic UCI data sets to Go and Diplomacy. In sequential decision-making, agent evaluation has largely been rest

Externí odkaz: http://arxiv.org/abs/2303.03196

Zobrazit plný text záznamu

Report

Developing, Evaluating and Scaling Learning Agents in Multi-Agent Environments

The Game Theory & Multi-Agent team at DeepMind studies several aspects of multi-agent learning ranging from computing approximations to fundamental concepts in game theory to simulating social dilemmas in rich spatial environments and training 3-d hu

Externí odkaz: http://arxiv.org/abs/2209.10958

Zobrazit plný text záznamu

Report

Autor: Muller, Paul, Elie, Romuald, Rowland, Mark, Lauriere, Mathieu, Perolat, Julien, Perrin, Sarah, Geist, Matthieu, Piliouras, Georgios, Pietquin, Olivier, Tuyls, Karl

The designs of many large-scale systems today, from traffic routing environments to smart grids, rely on game-theoretic equilibrium concepts. However, as the size of an $N$-player game typically grows exponentially with $N$, standard game theoretic a

Externí odkaz: http://arxiv.org/abs/2208.10138

Zobrazit plný text záznamu

Report

Mastering the Game of Stratego with Model-Free Multiagent Reinforcement Learning

We introduce DeepNash, an autonomous agent capable of learning to play the imperfect information game Stratego from scratch, up to a human expert level. Stratego is one of the few iconic board games that Artificial Intelligence (AI) has not yet maste

Externí odkaz: http://arxiv.org/abs/2206.15378

Zobrazit plný text záznamu

Plný text ve formátu HTML

Report

Learning in Mean Field Games: A Survey

Autor: Laurière, Mathieu, Perrin, Sarah, Pérolat, Julien, Girgin, Sertan, Muller, Paul, Élie, Romuald, Geist, Matthieu, Pietquin, Olivier

Non-cooperative and cooperative games with a very large number of players have many applications but remain generally intractable when the number of players increases. Introduced by Lasry and Lions, and Huang, Caines and Malham\'e, Mean Field Games (

Externí odkaz: http://arxiv.org/abs/2205.12944

Zobrazit plný text záznamu

Report

Scalable Deep Reinforcement Learning Algorithms for Mean Field Games

Autor: Laurière, Mathieu, Perrin, Sarah, Girgin, Sertan, Muller, Paul, Jain, Ayush, Cabannes, Theophile, Piliouras, Georgios, Pérolat, Julien, Élie, Romuald, Pietquin, Olivier, Geist, Matthieu

Mean Field Games (MFGs) have been introduced to efficiently approximate games with very large populations of strategic agents. Recently, the question of learning equilibria in MFGs has gained momentum, particularly using model-free reinforcement lear

Externí odkaz: http://arxiv.org/abs/2203.11973

Zobrazit plný text záznamu

Report

Learning Equilibria in Mean-Field Games: Introducing Mean-Field PSRO

Autor: Muller, Paul, Rowland, Mark, Elie, Romuald, Piliouras, Georgios, Perolat, Julien, Lauriere, Mathieu, Marinier, Raphael, Pietquin, Olivier, Tuyls, Karl

Recent advances in multiagent learning have seen the introduction ofa family of algorithms that revolve around the population-based trainingmethod PSRO, showing convergence to Nash, correlated and coarse corre-lated equilibria. Notably, when the numb

Externí odkaz: http://arxiv.org/abs/2111.08350

Zobrazit plný text záznamu

Report

Solving N-player dynamic routing games with congestion: a mean field approach

Autor: Cabannes, Theophile, Lauriere, Mathieu, Perolat, Julien, Marinier, Raphael, Girgin, Sertan, Perrin, Sarah, Pietquin, Olivier, Bayen, Alexandre M., Goubault, Eric, Elie, Romuald

The recent emergence of navigational tools has changed traffic patterns and has now enabled new types of congestion-aware routing control like dynamic road pricing. Using the fundamental diagram of traffic flows - applied in macroscopic and mesoscopi

Externí odkaz: http://arxiv.org/abs/2110.11943

Zobrazit plný text záznamu

Report

Shaking the foundations: delusions in sequence models for interaction and control

Autor: Ortega, Pedro A., Kunesch, Markus, Delétang, Grégoire, Genewein, Tim, Grau-Moya, Jordi, Veness, Joel, Buchli, Jonas, Degrave, Jonas, Piot, Bilal, Perolat, Julien, Everitt, Tom, Tallec, Corentin, Parisotto, Emilio, Erez, Tom, Chen, Yutian, Reed, Scott, Hutter, Marcus, de Freitas, Nando, Legg, Shane

The recent phenomenal success of language models has reinvigorated machine learning research, and large sequence models such as transformers are being applied to a variety of domains. One important problem class that has remained relatively elusive h

Externí odkaz: http://arxiv.org/abs/2110.10819

Zobrazit plný text záznamu

Report

Generalization in Mean Field Games by Learning Master Policies

Autor: Perrin, Sarah, Laurière, Mathieu, Pérolat, Julien, Élie, Romuald, Geist, Matthieu, Pietquin, Olivier

Mean Field Games (MFGs) can potentially scale multi-agent systems to extremely large populations of agents. Yet, most of the literature assumes a single initial distribution for the agents, which limits the practical applications of MFGs. Machine Lea

Externí odkaz: http://arxiv.org/abs/2109.09717

Zobrazit plný text záznamu

Vyhledávací nástroje:

Upřesnit hledání