Zobrazeno 1 - 10
of 80
pro vyhledávání: '"Perolat, Julien"'
Autor:
Lanctot, Marc, Schultz, John, Burch, Neil, Smith, Max Olan, Hennes, Daniel, Anthony, Thomas, Perolat, Julien
Progress in fields of machine learning and adversarial planning has benefited significantly from benchmark domains, from checkers and the classic UCI data sets to Go and Diplomacy. In sequential decision-making, agent evaluation has largely been rest
Externí odkaz:
http://arxiv.org/abs/2303.03196
Autor:
Gemp, Ian, Anthony, Thomas, Bachrach, Yoram, Bhoopchand, Avishkar, Bullard, Kalesha, Connor, Jerome, Dasagi, Vibhavari, De Vylder, Bart, Duenez-Guzman, Edgar, Elie, Romuald, Everett, Richard, Hennes, Daniel, Hughes, Edward, Khan, Mina, Lanctot, Marc, Larson, Kate, Lever, Guy, Liu, Siqi, Marris, Luke, McKee, Kevin R., Muller, Paul, Perolat, Julien, Strub, Florian, Tacchetti, Andrea, Tarassov, Eugene, Wang, Zhe, Tuyls, Karl
The Game Theory & Multi-Agent team at DeepMind studies several aspects of multi-agent learning ranging from computing approximations to fundamental concepts in game theory to simulating social dilemmas in rich spatial environments and training 3-d hu
Externí odkaz:
http://arxiv.org/abs/2209.10958
Autor:
Muller, Paul, Elie, Romuald, Rowland, Mark, Lauriere, Mathieu, Perolat, Julien, Perrin, Sarah, Geist, Matthieu, Piliouras, Georgios, Pietquin, Olivier, Tuyls, Karl
The designs of many large-scale systems today, from traffic routing environments to smart grids, rely on game-theoretic equilibrium concepts. However, as the size of an $N$-player game typically grows exponentially with $N$, standard game theoretic a
Externí odkaz:
http://arxiv.org/abs/2208.10138
Autor:
Perolat, Julien, de Vylder, Bart, Hennes, Daniel, Tarassov, Eugene, Strub, Florian, de Boer, Vincent, Muller, Paul, Connor, Jerome T., Burch, Neil, Anthony, Thomas, McAleer, Stephen, Elie, Romuald, Cen, Sarah H., Wang, Zhe, Gruslys, Audrunas, Malysheva, Aleksandra, Khan, Mina, Ozair, Sherjil, Timbers, Finbarr, Pohlen, Toby, Eccles, Tom, Rowland, Mark, Lanctot, Marc, Lespiau, Jean-Baptiste, Piot, Bilal, Omidshafiei, Shayegan, Lockhart, Edward, Sifre, Laurent, Beauguerlange, Nathalie, Munos, Remi, Silver, David, Singh, Satinder, Hassabis, Demis, Tuyls, Karl
We introduce DeepNash, an autonomous agent capable of learning to play the imperfect information game Stratego from scratch, up to a human expert level. Stratego is one of the few iconic board games that Artificial Intelligence (AI) has not yet maste
Externí odkaz:
http://arxiv.org/abs/2206.15378
Autor:
Laurière, Mathieu, Perrin, Sarah, Pérolat, Julien, Girgin, Sertan, Muller, Paul, Élie, Romuald, Geist, Matthieu, Pietquin, Olivier
Non-cooperative and cooperative games with a very large number of players have many applications but remain generally intractable when the number of players increases. Introduced by Lasry and Lions, and Huang, Caines and Malham\'e, Mean Field Games (
Externí odkaz:
http://arxiv.org/abs/2205.12944
Autor:
Laurière, Mathieu, Perrin, Sarah, Girgin, Sertan, Muller, Paul, Jain, Ayush, Cabannes, Theophile, Piliouras, Georgios, Pérolat, Julien, Élie, Romuald, Pietquin, Olivier, Geist, Matthieu
Mean Field Games (MFGs) have been introduced to efficiently approximate games with very large populations of strategic agents. Recently, the question of learning equilibria in MFGs has gained momentum, particularly using model-free reinforcement lear
Externí odkaz:
http://arxiv.org/abs/2203.11973
Autor:
Muller, Paul, Rowland, Mark, Elie, Romuald, Piliouras, Georgios, Perolat, Julien, Lauriere, Mathieu, Marinier, Raphael, Pietquin, Olivier, Tuyls, Karl
Recent advances in multiagent learning have seen the introduction ofa family of algorithms that revolve around the population-based trainingmethod PSRO, showing convergence to Nash, correlated and coarse corre-lated equilibria. Notably, when the numb
Externí odkaz:
http://arxiv.org/abs/2111.08350
Autor:
Cabannes, Theophile, Lauriere, Mathieu, Perolat, Julien, Marinier, Raphael, Girgin, Sertan, Perrin, Sarah, Pietquin, Olivier, Bayen, Alexandre M., Goubault, Eric, Elie, Romuald
The recent emergence of navigational tools has changed traffic patterns and has now enabled new types of congestion-aware routing control like dynamic road pricing. Using the fundamental diagram of traffic flows - applied in macroscopic and mesoscopi
Externí odkaz:
http://arxiv.org/abs/2110.11943
Autor:
Ortega, Pedro A., Kunesch, Markus, Delétang, Grégoire, Genewein, Tim, Grau-Moya, Jordi, Veness, Joel, Buchli, Jonas, Degrave, Jonas, Piot, Bilal, Perolat, Julien, Everitt, Tom, Tallec, Corentin, Parisotto, Emilio, Erez, Tom, Chen, Yutian, Reed, Scott, Hutter, Marcus, de Freitas, Nando, Legg, Shane
The recent phenomenal success of language models has reinvigorated machine learning research, and large sequence models such as transformers are being applied to a variety of domains. One important problem class that has remained relatively elusive h
Externí odkaz:
http://arxiv.org/abs/2110.10819
Autor:
Perrin, Sarah, Laurière, Mathieu, Pérolat, Julien, Élie, Romuald, Geist, Matthieu, Pietquin, Olivier
Mean Field Games (MFGs) can potentially scale multi-agent systems to extremely large populations of agents. Yet, most of the literature assumes a single initial distribution for the agents, which limits the practical applications of MFGs. Machine Lea
Externí odkaz:
http://arxiv.org/abs/2109.09717