Výsledky vyhledávání - "Sinclair, Sean A."

Report

The Data-Driven Censored Newsvendor Problem

Autor: Hssaine, Chamsi, Sinclair, Sean R.

We study a censored variant of the data-driven newsvendor problem, where the decision-maker must select an ordering quantity that minimizes expected overage and underage costs based only on offline censored sales data, rather than historical demand r

Externí odkaz: http://arxiv.org/abs/2412.01763

Zobrazit plný text záznamu

Report

Exploiting Exogenous Structure for Sample-Efficient Reinforcement Learning

Autor: Wan, Jia, Sinclair, Sean R., Shah, Devavrat, Wainwright, Martin J.

We study a class of structured Markov Decision Processes (MDPs) known as Exo-MDPs. They are characterized by a partition of the state space into two components: the exogenous states evolve stochastically in a manner not affected by the agent's action

Externí odkaz: http://arxiv.org/abs/2409.14557

Zobrazit plný text záznamu

Report

Multi-Objective LQR with Linear Scalarization

Autor: Jadbabaie, Ali, Shah, Devavrat, Sinclair, Sean R.

The framework of decision-making, modeled as a Markov Decision Process (MDP), typically assumes a single objective. However, most practical scenarios involve considering tradeoffs between multiple objectives. With that as the motivation, we consider

Externí odkaz: http://arxiv.org/abs/2408.04488

Zobrazit plný text záznamu

Report

Online Fair Allocation of Perishable Resources

Autor: Banerjee, Siddhartha, Hssaine, Chamsi, Sinclair, Sean R.

We consider a practically motivated variant of the canonical online fair allocation problem: a decision-maker has a budget of perishable resources to allocate over a fixed number of rounds. Each round sees a random number of arrivals, and the decisio

Externí odkaz: http://arxiv.org/abs/2406.02402

Zobrazit plný text záznamu

Report

Artificial Replay: A Meta-Algorithm for Harnessing Historical Data in Bandits

Autor: Banerjee, Siddhartha, Sinclair, Sean R., Tambe, Milind, Xu, Lily, Yu, Christina Lee

Most real-world deployments of bandit algorithms exist somewhere in between the offline and online set-up, where some historical data is available upfront and additional data is collected dynamically online. How best to incorporate historical data to

Externí odkaz: http://arxiv.org/abs/2210.00025

Zobrazit plný text záznamu

Report

Hindsight Learning for MDPs with Exogenous Inputs

Autor: Sinclair, Sean R., Frujeri, Felipe, Cheng, Ching-An, Marshall, Luke, Barbalho, Hugo, Li, Jingling, Neville, Jennifer, Menache, Ishai, Swaminathan, Adith

Many resource management problems require sequential decision-making under uncertainty, where the only uncertainty affecting the decision outcomes are exogenous variables outside the control of the decision-maker. We model these problems as Exo-MDPs

Externí odkaz: http://arxiv.org/abs/2207.06272

Zobrazit plný text záznamu

Report

Adaptive Discretization in Online Reinforcement Learning

Autor: Sinclair, Sean R., Banerjee, Siddhartha, Yu, Christina Lee

Discretization based approaches to solving online reinforcement learning problems have been studied extensively in practice on applications ranging from resource allocation to cache management. Two major questions in designing discretization-based al

Externí odkaz: http://arxiv.org/abs/2110.15843

Zobrazit plný text záznamu

Report

Sequential Fair Allocation: Achieving the Optimal Envy-Efficiency Tradeoff Curve

Autor: Sinclair, Sean R., Jain, Gauri, Banerjee, Siddhartha, Yu, Christina Lee

We consider the problem of dividing limited resources to individuals arriving over $T$ rounds. Each round has a random number of individuals arrive, and individuals can be characterized by their type (i.e. preferences over the different resources). A

Externí odkaz: http://arxiv.org/abs/2105.05308

Zobrazit plný text záznamu

Report

Sequential Fair Allocation of Limited Resources under Stochastic Demands

Autor: Sinclair, Sean R., Jain, Gauri, Banerjee, Siddhartha, Yu, Christina Lee

We consider the problem of dividing limited resources between a set of agents arriving sequentially with unknown (stochastic) utilities. Our goal is to find a fair allocation - one that is simultaneously Pareto-efficient and envy-free. When all utili

Externí odkaz: http://arxiv.org/abs/2011.14382

Zobrazit plný text záznamu

Report

Adaptive Discretization for Model-Based Reinforcement Learning

Autor: Sinclair, Sean R., Wang, Tianyu, Jain, Gauri, Banerjee, Siddhartha, Yu, Christina Lee

We introduce the technique of adaptive discretization to design an efficient model-based episodic reinforcement learning algorithm in large (potentially continuous) state-action spaces. Our algorithm is based on optimistic one-step value iteration ex

Externí odkaz: http://arxiv.org/abs/2007.00717

Zobrazit plný text záznamu

Vyhledávací nástroje:

Upřesnit hledání