Výsledky vyhledávání - "Gast, Nicolas"

Report

Model Predictive Control is Almost Optimal for Restless Bandit

Autor: Gast, Nicolas, Narasimha, Dheeraj

We consider the discrete time infinite horizon average reward restless markovian bandit (RMAB) problem. We propose a \emph{model predictive control} based non-stationary policy with a rolling computational horizon $\tau$. At each time-slot, this poli

Externí odkaz: http://arxiv.org/abs/2410.06307

Zobrazit plný text záznamu

Report

Prophet Inequalities: Competing with the Top $\ell$ Items is Easy

Autor: Molina, Mathieu, Gast, Nicolas, Loiseau, Patrick, Perchet, Vianney

We explore a novel variant of the classical prophet inequality problem, where the values of a sequence of items are drawn i.i.d. from some distribution, and an online decision maker must select one item irrevocably. We establish that the competitive

Externí odkaz: http://arxiv.org/abs/2408.07616

Zobrazit plný text záznamu

Report

Computing the Bias of Constant-step Stochastic Approximation with Markovian Noise

Autor: Allmeier, Sebastian, Gast, Nicolas

We study stochastic approximation algorithms with Markovian noise and constant step-size $\alpha$. We develop a method based on infinitesimal generator comparisons to study the bias of the algorithm, which is the expected difference between $\theta_n

Externí odkaz: http://arxiv.org/abs/2405.14285

Zobrazit plný text záznamu

Report

Accuracy of the Graphon Mean Field Approximation for Interacting Particle Systems

Autor: Allmeier, Sebastian, Gast, Nicolas

We consider a system of $N$ particles whose interactions are characterized by a (weighted) graph $G^N$. Each particle is a node of the graph with an internal state. The state changes according to Markovian dynamics that depend on the states and conne

Externí odkaz: http://arxiv.org/abs/2405.08623

Zobrazit plný text záznamu

Report

Trading-off price for data quality to achieve fair online allocation

Autor: Molina, Mathieu, Gast, Nicolas, Loiseau, Patrick, Perchet, Vianney

We consider the problem of online allocation subject to a long-term fairness penalty. Contrary to existing works, however, we do not assume that the decision-maker observes the protected attributes -- which is often unrealistic in practice. Instead t

Externí odkaz: http://arxiv.org/abs/2306.13440

Zobrazit plný text záznamu

Report

Decentralized model-free reinforcement learning in stochastic games with average-reward objective

Autor: Cravic, Romain, Gast, Nicolas, Gaujal, Bruno

We propose the first model-free algorithm that achieves low regret performance for decentralized learning in two-player zero-sum tabular stochastic games with infinite-horizon average-reward objective. In decentralized learning, the learning agent co

Externí odkaz: http://arxiv.org/abs/2301.05630

Zobrazit plný text záznamu

Report

Bias and Refinement of Multiscale Mean Field Models

Autor: Allmeier, Sebastian, Gast, Nicolas

Mean field approximation is a powerful technique which has been used in many settings to study large-scale stochastic systems. In the case of two-timescale systems, the approximation is obtained by a combination of scaling arguments and the use of th

Externí odkaz: http://arxiv.org/abs/2211.11382

Zobrazit plný text záznamu

Report

Reoptimization Nearly Solves Weakly Coupled Markov Decision Processes

Autor: Gast, Nicolas, Gaujal, Bruno, Yan, Chen

We propose a new policy, called the LP-update policy, to solve finite horizon weakly-coupled Markov decision processes. The latter can be seen as multi-constraint multi-action bandits, and generalize the classical restless bandit problems. Our soluti

Externí odkaz: http://arxiv.org/abs/2211.01961

Zobrazit plný text záznamu

Report

Fairness in Selection Problems with Strategic Candidates

Autor: Emelianov, Vitalii, Gast, Nicolas, Loiseau, Patrick

To better understand discriminations and the effect of affirmative actions in selection problems (e.g., college admission or hiring), a recent line of research proposed a model based on differential variance. This model assumes that the decision-make

Externí odkaz: http://arxiv.org/abs/2205.12204

Zobrazit plný text záznamu

Report

Testing Indexability and Computing Whittle and Gittins Index in Subcubic Time

Autor: Gast, Nicolas, Gaujal, Bruno, Khun, Kimang

Whittle index is a generalization of Gittins index that provides very efficient allocation rules for restless multi-armed bandits. In this work, we develop an algorithm to test the indexability and compute the Whittle indices of any finite-state rest

Externí odkaz: http://arxiv.org/abs/2203.05207

Zobrazit plný text záznamu

Plný text ve formátu HTML

Vyhledávací nástroje:

Upřesnit hledání