Zobrazeno 1 - 10
of 3 314
pro vyhledávání: '"Large games"'
We consider regret minimization in repeated games with a very large number of actions. Such games are inherent in the setting of AI Safety via Debate \cite{irving2018ai}, and more generally games whose actions are language-based. Existing algorithms
Externí odkaz:
http://arxiv.org/abs/2312.04792
Autor:
Carmona, Guilherme1 (AUTHOR) g.carmona@surrey.ac.uk, Podczeck, Konrad2 (AUTHOR)
Publikováno v:
Economic Theory. Apr2022, Vol. 73 Issue 2/3, p679-694. 16p.
This paper presents a general closed graph property for (randomized strategy) Nash equilibrium correspondence in large games. In particular, we show that for any large game with a convergent sequence of fiinite-player games, the limit of any converge
Externí odkaz:
http://arxiv.org/abs/2011.06789
Finding approximate Nash equilibria in zero-sum imperfect-information games is challenging when the number of information states is large. Policy Space Response Oracles (PSRO) is a deep reinforcement learning algorithm grounded in game theory that is
Externí odkaz:
http://arxiv.org/abs/2006.08555
Akademický článek
Tento výsledek nelze pro nepřihlášené uživatele zobrazit.
K zobrazení výsledku je třeba se přihlásit.
K zobrazení výsledku je třeba se přihlásit.
Publikováno v:
Economic Theory, 2019 Apr 01. 67(3), 497-523.
Externí odkaz:
https://www.jstor.org/stable/45200134
Autor:
Zhang, Brian Hu, Sandholm, Tuomas
In many game settings, the game is not explicitly given but is only accessible by playing it. While there have been impressive demonstrations in such settings, prior techniques have not offered safety guarantees, that is, guarantees on the game-theor
Externí odkaz:
http://arxiv.org/abs/2006.16387
Autor:
Timbers, Finbarr, Bard, Nolan, Lockhart, Edward, Lanctot, Marc, Schmid, Martin, Burch, Neil, Schrittwieser, Julian, Hubert, Thomas, Bowling, Michael
Researchers have demonstrated that neural networks are vulnerable to adversarial examples and subtle environment changes, both of which one can view as a form of distribution shift. To humans, the resulting errors can look like blunders, eroding trus
Externí odkaz:
http://arxiv.org/abs/2004.09677
Publikováno v:
In Journal of Economic Theory April 2022 201
This paper proposes a new equilibrium concept "robust perfect equilibrium" for non-cooperative games with a continuum of players, incorporating three types of perturbations. Such an equilibrium is shown to exist (in symmetric mixed strategies and in
Externí odkaz:
http://arxiv.org/abs/1912.12908