Zobrazeno 1 - 10
of 10
pro vyhledávání: '"Yonathan Efroni"'
Publikováno v:
AAAI 19-Thirty-Third AAAI Conference on Artificial Intelligence
AAAI 19-Thirty-Third AAAI Conference on Artificial Intelligence, Jan 2019, Honolulu, Hawai, United States
AAAI
Scopus-Elsevier
HAL
AAAI 19-Thirty-Third AAAI Conference on Artificial Intelligence, Jan 2019, Honolulu, Hawai, United States
AAAI
Scopus-Elsevier
HAL
Finite-horizon lookahead policies are abundantly used in Reinforcement Learning and demonstrate impressive empirical success. Usually, the lookahead policies are implemented with specific planning methods such as Monte Carlo Tree Search (e.g. in Alph
Externí odkaz:
https://explore.openaire.eu/search/publication?articleId=doi_dedup___::196f916985c9606299217617b4abc623
https://hal.inria.fr/hal-02273713
https://hal.inria.fr/hal-02273713
Publikováno v:
AAAI
Trust region policy optimization (TRPO) is a popular and empirically successful policy search algorithm in Reinforcement Learning (RL) in which a surrogate problem, that restricts consecutive policies to be 'close' to one another, is iteratively solv
Externí odkaz:
https://explore.openaire.eu/search/publication?articleId=doi_dedup___::e79450302c758d5c1bdc9a8a9cc3fffc
Publikováno v:
NeurIPS 2018-Thirty-second Conference on Neural Information Processing Systems
NeurIPS 2018-Thirty-second Conference on Neural Information Processing Systems, Dec 2018, Montréal, Canada
Scopus-Elsevier
HAL
NeurIPS 2018-Thirty-second Conference on Neural Information Processing Systems, Dec 2018, Montréal, Canada
Scopus-Elsevier
HAL
Multiple-step lookahead policies have demonstrated high empirical competence in Reinforcement Learning, via the use of Monte Carlo Tree Search or Model Predictive Control. In a recent work \cite{efroni2018beyond}, multiple-step greedy policies and th
Externí odkaz:
https://explore.openaire.eu/search/publication?articleId=doi_dedup___::f028381a3aec07d15398adac3b208d6d
https://inria.hal.science/hal-01927962/file/approximate_online_cr_final.pdf
https://inria.hal.science/hal-01927962/file/approximate_online_cr_final.pdf
Publikováno v:
Physical Review Letters
We show that carbon nanotubes (CNT) can be driven through a topological phase transition using either strain or a magnetic field. This can naturally lead to Jackiw-Rebbi soliton states carrying fractionalized charges, similar to those found in a doma
Externí odkaz:
https://explore.openaire.eu/search/publication?articleId=doi_dedup___::886355c82bf4c8d15871427453cc82bd
http://arxiv.org/abs/1608.05976
http://arxiv.org/abs/1608.05976
Autor:
LANDERS, MATTHEW1 mlanders@virginia.edu, DORYAB, AFSANEH1 ad4ks@virginia.edu
Publikováno v:
ACM Computing Surveys. 2023 Suppl14s, Vol. 55, p1-31. 31p.
Publikováno v:
HAL
EWRL 2018-14th European workshop on Reinforcement Learning
EWRL 2018-14th European workshop on Reinforcement Learning, Oct 2018, Lille, France
EWRL 2018-14th European workshop on Reinforcement Learning
EWRL 2018-14th European workshop on Reinforcement Learning, Oct 2018, Lille, France
International audience; Anderson (1965) acceleration is an old and simple method for accelerating the computation of a fixed point. However, as far as we know and quite surprisingly, it has never been applied to dynamic programming or reinforcement l
Externí odkaz:
https://explore.openaire.eu/search/publication?articleId=dedup_wf_001::be5e4a4d6112bcc463e899b0fe963490
https://hal.inria.fr/hal-01927977
https://hal.inria.fr/hal-01927977
Publikováno v:
ACM Computing Surveys; Jun2024, Vol. 56 Issue 6, p1-39, 39p
Autor:
Du, Simon S., Krishnamurthy, Akshay, Jiang, Nan, Agarwal, Alekh, Dudík, Miroslav, Langford, John
We study the exploration problem in episodic MDPs with rich observations generated from a small number of latent states. Under certain identifiability assumptions, we demonstrate how to estimate a mapping from the observations to latent states induct
Externí odkaz:
http://arxiv.org/abs/1901.09018
Publikováno v:
AI Magazine; Spring2019, Vol. 40 Issue 1, p83-92, 7p
Artificial intelligence, or AI, now affects the day-to-day life of almost everyone on the planet, and continues to be a perennial hot topic in the news. This book presents the proceedings of ECAI 2023, the 26th European Conference on Artificial Intel