Výsledky vyhledávání

Akademický článek

Tento výsledek nelze pro nepřihlášené uživatele zobrazit.
K zobrazení výsledku je třeba se přihlásit.

Akademický článek

Tento výsledek nelze pro nepřihlášené uživatele zobrazit.
K zobrazení výsledku je třeba se přihlásit.

Safe policy iteration: A monotonically improving approximate policy iteration approach

Autor: Metelli, A. M., Pirotta, M., Calandriello, D., Marcello Restelli

Publikováno v: Scopus-Elsevier

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=dedup_wf_001::25bed31f042960a3723cf3235ad33850
http://hdl.handle.net/11311/1177647

Zobrazit plný text záznamu

Adversarial Attacks on Linear Contextual Bandits

Autor: Garcelon, E., Baptiste Rozière, Meunier, L., Tarbouriech, J., Teytaud, O., Lazaric, A., Pirotta, M.

Publikováno v: Scopus-Elsevier

Contextual bandit algorithms are applied in a wide range of domains, from advertising to recommender systems, from clinical trials to education. In many of these domains, malicious agents may have incentives to attack the bandit algorithm to induce i

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=doi_dedup___::7417c371dc0b6a9b8b6ac5cc564f32cd
http://arxiv.org/abs/2002.03839

Zobrazit plný text záznamu

Akademický článek

Tento výsledek nelze pro nepřihlášené uživatele zobrazit.
K zobrazení výsledku je třeba se přihlásit.

Akademický článek

Optimising Management Of Knee Osteoarthritis In Primary Care (Partner): A Cluster Randomised Controlled Trial

Autor: Bowden, J.L., Hunter, D.J., Hinman, R.S., Egerton, T., Briggs, A.M., Bunker, S.J., French, S.D., Pirotta, M., Shrestha, R., Schofield, D.J., Schuck, K., Zwar, N.A., Silva, S.M., Heller, G.Z., Bennell, K.L.

Publikováno v: In Osteoarthritis and Cartilage March 2023 31 Supplement 1:S29-S30

Zobrazit plný text záznamu

Akademický článek

Tento výsledek nelze pro nepřihlášené uživatele zobrazit.
K zobrazení výsledku je třeba se přihlásit.

Stochastic Variance-Reduced Policy Gradient

Autor: Matteo Papini, Binaghi, D., Canonaco, G., Pirotta, M., Restelli, M.

Publikováno v: ICML 2018-35th International Conference on Machine Learning
ICML 2018-35th International Conference on Machine Learning, Jul 2018, Stockholm, Sweden. pp.4026-4035
Scopus-Elsevier

International audience; In this paper, we propose a novel reinforcement-learning algorithm consisting in a stochastic variance-reduced version of policy gradient for solving Markov Decision Processes (MDPs). Stochastic variance-reduced gradient (SVRG

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=doi_dedup___::1cea91ed5a2617e381666dfd767b5cad
https://hal.inria.fr/hal-01940394/document

Zobrazit plný text záznamu

Efficient Bias-Span-Constrained Exploration-Exploitation in Reinforcement Learning

Autor: Fruit, R., Pirotta, M., Lazaric, A., Ronald Ortner

Publikováno v: ICML 2018-The 35th International Conference on Machine Learning
ICML 2018-The 35th International Conference on Machine Learning, Jul 2018, Stockholm, Sweden. pp.1578-1586
Scopus-Elsevier

International audience; We introduce SCAL, an algorithm designed to perform efficient exploration-exploitation in any unknown weakly-communicating Markov decision process (MDP) for which an upper bound $c$ on the span of the optimal bias function is

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=doi_dedup___::93c12cb6236f966b8dadb4023b4a3d80
http://arxiv.org/abs/1802.04020

Zobrazit plný text záznamu

Vyhledávací nástroje:

Upřesnit hledání