Výsledky vyhledávání - "Yonathan Efroni"

How to Combine Tree-Search Methods in Reinforcement Learning

Autor: Shie Mannor, Bruno Scherrer, Yonathan Efroni, Gal Dalal

Publikováno v: AAAI 19-Thirty-Third AAAI Conference on Artificial Intelligence
AAAI 19-Thirty-Third AAAI Conference on Artificial Intelligence, Jan 2019, Honolulu, Hawai, United States
AAAI
Scopus-Elsevier
HAL

Finite-horizon lookahead policies are abundantly used in Reinforcement Learning and demonstrate impressive empirical success. Usually, the lookahead policies are implemented with specific planning methods such as Monte Carlo Tree Search (e.g. in Alph

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=doi_dedup___::196f916985c9606299217617b4abc623
https://hal.inria.fr/hal-02273713

Zobrazit plný text záznamu

Adaptive Trust Region Policy Optimization: Global Convergence and Faster Rates for Regularized MDPs

Autor: Shie Mannor, Yonathan Efroni, Lior Shani

Publikováno v: AAAI

Trust region policy optimization (TRPO) is a popular and empirically successful policy search algorithm in Reinforcement Learning (RL) in which a surrogate problem, that restricts consecutive policies to be 'close' to one another, is iteratively solv

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=doi_dedup___::e79450302c758d5c1bdc9a8a9cc3fffc

Zobrazit plný text záznamu

Multiple-step greedy policies in online and approximate reinforcement learning

Autor: Yonathan Efroni, Gal Dalal, Bruno Scherrer, Shie Mannor

Publikováno v: NeurIPS 2018-Thirty-second Conference on Neural Information Processing Systems
NeurIPS 2018-Thirty-second Conference on Neural Information Processing Systems, Dec 2018, Montréal, Canada
Scopus-Elsevier
HAL

Multiple-step lookahead policies have demonstrated high empirical competence in Reinforcement Learning, via the use of Monte Carlo Tree Search or Model Predictive Control. In a recent work \cite{efroni2018beyond}, multiple-step greedy policies and th

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=doi_dedup___::f028381a3aec07d15398adac3b208d6d
https://inria.hal.science/hal-01927962/file/approximate_online_cr_final.pdf

Zobrazit plný text záznamu

Topological transitions and fractional charges induced by strain and magnetic field in carbon nanotubes

Autor: Shahal Ilani, Erez Berg, Yonathan Efroni

Publikováno v: Physical Review Letters

We show that carbon nanotubes (CNT) can be driven through a topological phase transition using either strain or a magnetic field. This can naturally lead to Jackiw-Rebbi soliton states carrying fractionalized charges, similar to those found in a doma

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=doi_dedup___::886355c82bf4c8d15871427453cc82bd
http://arxiv.org/abs/1608.05976

Zobrazit plný text záznamu

Akademický článek

Deep Reinforcement Learning Verification: A Survey.

Autor: LANDERS, MATTHEW¹ mlanders@virginia.edu, DORYAB, AFSANEH¹ ad4ks@virginia.edu

Publikováno v: ACM Computing Surveys. 2023 Suppl14s, Vol. 55, p1-31. 31p.

Zobrazit plný text záznamu

Convergence of Online and Approximate Multiple-Step Lookahead Policy Iteration

Autor: Yonathan Efroni, Gal Dalal, Bruno Scherrer, Shie Mannor

Publikováno v: HAL
EWRL 2018-14th European workshop on Reinforcement Learning
EWRL 2018-14th European workshop on Reinforcement Learning, Oct 2018, Lille, France

International audience; Anderson (1965) acceleration is an old and simple method for accelerating the computation of a fixed point. However, as far as we know and quite surprisingly, it has never been applied to dynamic programming or reinforcement l

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=dedup_wf_001::be5e4a4d6112bcc463e899b0fe963490
https://hal.inria.fr/hal-01927977

Zobrazit plný text záznamu

Akademický článek

Security and Privacy Issues in Deep Reinforcement Learning: Threats and Countermeasures.

Autor: Mo, Kanghua, Ye, Peigen, Ren, Xiaojun, Wang, Shaowei, Li, Wenjun, Li, Jin

Publikováno v: ACM Computing Surveys; Jun2024, Vol. 56 Issue 6, p1-39, 39p

Zobrazit plný text záznamu

Report

Provably efficient RL with Rich Observations via Latent State Decoding

Autor: Du, Simon S., Krishnamurthy, Akshay, Jiang, Nan, Agarwal, Alekh, Dudík, Miroslav, Langford, John

We study the exploration problem in episodic MDPs with rich observations generated from a small number of latent states. Under certain identifiability assumptions, we demonstrate how to estimate a mapping from the observations to latent states induct

Externí odkaz: http://arxiv.org/abs/1901.09018

Zobrazit plný text záznamu

Akademický článek

Congratulations to the 2019 AAAI Award Winners!

Publikováno v: AI Magazine; Spring2019, Vol. 40 Issue 1, p83-92, 7p

Zobrazit plný text záznamu

Elektronická kniha

ECAI 2023 : 26th European Conference on Artificial Intelligence, September 30 – October 4, 2023, Kraków, Poland – Including 12th Conference on Prestigious Applications of Intelligent Systems (PAIS 2023)

Autor: K. Gal, A. Nowé, G.J. Nalepa

Artificial intelligence, or AI, now affects the day-to-day life of almost everyone on the planet, and continues to be a perennial hot topic in the news. This book presents the proceedings of ECAI 2023, the 26th European Conference on Artificial Intel

Zobrazit plný text záznamu

Vyhledávací nástroje:

Upřesnit hledání