Zobrazeno 1 - 10
of 121
pro vyhledávání: '"Busic, Ana"'
The expected regret of any reinforcement learning algorithm is lower bounded by $\Omega\left(\sqrt{DXAT}\right)$ for undiscounted returns, where $D$ is the diameter of the Markov decision process, $X$ the size of the state space, $A$ the size of the
Externí odkaz:
http://arxiv.org/abs/2406.04766
Publikováno v:
2023 IEEE 62nd Conference on Decision and Control (CDC), 2023
Utilities have introduced demand charges to encourage customers to reduce their demand peaks, since a high peak may cause very high costs for both the utility and the consumer. We herein study the bill minimization problem for customers equipped with
Externí odkaz:
http://arxiv.org/abs/2402.07525
We consider a stochastic matching model with a general compatibility graph, as introduced in \cite{MaiMoy16}. We prove that most common matching policies (including FCFM, priorities and random) satisfy a particular sub-additive property, which we exp
Externí odkaz:
http://arxiv.org/abs/2305.00187
Optimal transport is now a standard tool for solving many problems in statistics and machine learning. The optimal "transport of probability measures" is also a recurring theme in stochastic control and distributed control, where in the latter applic
Externí odkaz:
http://arxiv.org/abs/2208.01958
To mitigate issues related to the growth of variable smart loads and distributed generation, distribution system operators (DSO) now make it binding for prosumers with inverters to operate under pre-set rules. In particular, the maximum active and re
Externí odkaz:
http://arxiv.org/abs/2207.10248
Stochastic dynamic matching problems have recently gained attention in the stochastic-modeling community due to their diverse applications, such as supply-chain management and kidney exchange programs. In this paper, we study a matching problem where
Externí odkaz:
http://arxiv.org/abs/2112.14457
A collection of thermostatically controlled loads (TCLs) -- such as air conditioners and water heaters -- can vary their power consumption within limits to help the balancing authority of a power grid maintain demand supply balance. Doing so requires
Externí odkaz:
http://arxiv.org/abs/2108.05840
We consider a peer-to-peer electricity market, where agents hold private information that they might not want to share. The problem is modeled as a noncooperative communication game, which takes the form of a Generalized Nash Equilibrium Problem, whe
Externí odkaz:
http://arxiv.org/abs/2101.06922
Thermostatically controlled loads (TCLs) have the potential to be a valuable resource for the Balancing Authority (BA) of the future. Examples of TCLs include household appliances such as air conditioners, water heaters, and refrigerators. Since the
Externí odkaz:
http://arxiv.org/abs/2009.12960
We study the performance of general dynamic matching models. This model is defined by a connected graph, where nodes represent the class of items and the edges the compatibilities between items. Items of different classes arrive one by one to the sys
Externí odkaz:
http://arxiv.org/abs/2009.10009