Výsledky vyhledávání - "Kaledin, Maxim"

Report

Variance Reduction for Policy-Gradient Methods via Empirical Variance Minimization

Autor: Kaledin, Maxim, Golubev, Alexander, Belomestny, Denis

Policy-gradient methods in Reinforcement Learning(RL) are very universal and widely applied in practice but their performance suffers from the high variance of the gradient estimate. Several procedures were proposed to reduce it including actor-criti

Externí odkaz: http://arxiv.org/abs/2206.06827

Zobrazit plný text záznamu

Report

Finite Time Analysis of Linear Two-timescale Stochastic Approximation with Markovian Noise

Autor: Kaledin, Maxim, Moulines, Eric, Naumov, Alexey, Tadic, Vladislav, Wai, Hoi-To

Linear two-timescale stochastic approximation (SA) scheme is an important class of algorithms which has become popular in reinforcement learning (RL), particularly for the policy evaluation problem. Recently, a number of works have been devoted to es

Externí odkaz: http://arxiv.org/abs/2002.01268

Zobrazit plný text záznamu

Akademický článek

Semitractability of optimal stopping problems via a weighted stochastic mesh algorithm.

Autor: Belomestny, Denis^1,2 (AUTHOR), Kaledin, Maxim² (AUTHOR), Schoenmakers, John³ (AUTHOR) schoenma@wias-berlin.de

Publikováno v: Mathematical Finance. Oct2020, Vol. 30 Issue 4, p1591-1616. 26p.

Zobrazit plný text záznamu

Plný text ve formátu HTML

Semi-tractability of optimal stopping problems via a weighted stochastic mesh algorithm

Autor: Belomestny, Denis, Kaledin, Maxim, Schoenmakers, John G. M.

In this article we propose a Weighted Stochastic Mesh (WSM) Algorithm for approximating the value of a discrete and continuous time optimal stopping problem. We prove that in the discrete case the WSM algorithm leads to semi-tractability of the corre

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=doi_dedup___::1c5303e57b0a64b45e1dc859cacb7843

Zobrazit plný text záznamu

Akademický článek

Categorical smooth compactifications and generalized Hodge-to-de Rham degeneration.

Autor: Efimov, Alexander I.^1,2 (AUTHOR) efimov@mccme.ru

Publikováno v: Inventiones Mathematicae. Nov2020, Vol. 222 Issue 2, p667-694. 28p.

Zobrazit plný text záznamu

Plný text ve formátu HTML

Akademický článek

Homotopy finiteness of some DG categories from algebraic geometry.

Autor: Efimov, Alexander I.

Publikováno v: Journal of the European Mathematical Society (EMS Publishing); 2020, Vol. 22 Issue 9, p2879-2942, 64p

Zobrazit plný text záznamu

Elektronická kniha

European Congress of Mathematics | PortoroÅ¾, 20-26 June, 2021

Autor: Ademir HujduroviÄ‡, Klavdija Kutnar, Dragan MaruÅ¡iÄ, Å tefko MiklaviÄ, TomaÅ¾ Pisanski, PrimoÅ¾ Å parl

The European Congress of Mathematics, held every four years, is a well-established major international mathematical event. Following those in Paris (1992), Budapest (1996), Barcelona (2000), Stockholm (2004), Amsterdam (2008), KrakÃ³w (2012), and B

Zobrazit plný text záznamu

Vyhledávací nástroje:

Upřesnit hledání