Výsledky vyhledávání

Akademický článek

Explainability-based Trust Algorithm for electricity price forecasting models

Autor: Leena Heistrene, Ram Machlev, Michael Perl, Juri Belikov, Dmitry Baimel, Kfir Levy, Shie Mannor, Yoash Levron

Publikováno v: Energy and AI, Vol 14, Iss , Pp 100259- (2023)

Advanced machine learning (ML) algorithms have outperformed traditional approaches in various forecasting applications, especially electricity price forecasting (EPF). However, the prediction accuracy of ML reduces substantially if the input data is

Externí odkaz: https://doaj.org/article/11eb963bb8844d9caeb7ef96ebe7f27f

Zobrazit plný text záznamu

Multi-User Communication Networks: A Coordinated Multi-Armed Bandit Approach

Autor: Orly Avner, Shie Mannor

Publikováno v: IEEE/ACM Transactions on Networking. 27:2192-2207

Communication networks shared by many users are a widespread challenge nowadays. In this paper we address several aspects of this challenge simultaneously: learning unknown stochastic network characteristics, sharing resources with other users while

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=doi_dedup___::a2d5dc1cfff174b0a3294ebc00290bb1
https://doi.org/10.1109/tnet.2019.2935043

Zobrazit plný text záznamu

Locality Matters: A Scalable Value Decomposition Approach for Cooperative Multi-Agent Reinforcement Learning

Autor: Roy Zohar, Shie Mannor, Guy Tennenholtz

Cooperative multi-agent reinforcement learning (MARL) faces significant scalability issues due to state and action spaces that are exponentially large in the number of agents. As environments grow in size, effective credit assignment becomes increasi

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=doi_dedup___::862809da7e566e4ff5eaf02bc0f25f70
http://arxiv.org/abs/2109.10632

Zobrazit plný text záznamu

On the Volatility of Optimal Control Policies of a Class of Linear Quadratic Regulators

Autor: Arman C. Kizilkale, Avi Mohan, Shie Mannor

Publikováno v: ACC

It is well known that highly volatile control laws, while theoretically optimal for certain systems, are undesirable from an engineering perspective, being generally deleterious to the controlled system. In this article we are concerned with the temp

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=doi_________::ae549ea3ee50366957cd54492d83fa6e
https://doi.org/10.23919/acc50511.2021.9482645

Zobrazit plný text záznamu

On-Line Learning of Linear Dynamical Systems: Exponential Forgetting in Kalman Filters

Autor: Mark Kozdoba, Tigran T. Tchrakian, Jakub Marecek, Shie Mannor

Publikováno v: AAAI
Scopus-Elsevier

The Kalman filter is a key tool for time-series forecasting and analysis. We show that the dependence of a prediction of Kalman filter on the past is decaying exponentially, whenever the process noise is non-degenerate. Therefore, Kalman filter may b

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=doi_dedup___::82b13725b93bce206bf294830a69dab7
https://doi.org/10.1609/aaai.v33i01.33014098

Zobrazit plný text záznamu

Chance-Constrained Outage Scheduling Using a Machine Learning Proxy

Autor: Gal Dalal, Louis Wehenkel, Shie Mannor, Elad Gilboa

Publikováno v: IEEE Transactions on Power Systems. 34:2528-2540

Outage scheduling aims at defining, over a horizon of several months to years, when different components needing maintenance should be taken out of operation. Its objective is to minimize operation-cost expectation while satisfying reliability-relate

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=doi_dedup___::1e9af80e2b6b889cafa6307d6f97699e
https://doi.org/10.1109/tpwrs.2018.2889237

Zobrazit plný text záznamu

Robust Value Iteration for Continuous Control Tasks

Autor: Dieter Fox, Jan Peters, Michael Lutter, Animesh Garg, Shie Mannor

Publikováno v: Robotics: Science and Systems

When transferring a control policy from simulation to a physical system, the policy needs to be robust to variations in the dynamics to perform well. Commonly, the optimal policy overfits to the approximate model and the corresponding state-distribut

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=doi_dedup___::5de45951b44f926f4eaa1514572e26d2

Zobrazit plný text záznamu

Continuous-Time Fitted Value Iteration for Robust Policies

Autor: Michael Lutter, Boris Belousov, Shie Mannor, Dieter Fox, Animesh Garg, Jan Peters

Solving the Hamilton-Jacobi-Bellman equation is important in many domains including control, robotics and economics. Especially for continuous control, solving this differential equation and its extension the Hamilton-Jacobi-Isaacs equation, is impor

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=doi_dedup___::dc2dbb05aeabe501dc78de337701ac30

Zobrazit plný text záznamu

Online Apprenticeship Learning

Autor: Lior Shani, Tom Zahavy, Shie Mannor

In Apprenticeship Learning (AL), we are given a Markov Decision Process (MDP) without access to the cost function. Instead, we observe trajectories sampled by an expert that acts according to some policy. The goal is to find a policy that matches the

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=doi_dedup___::eb7cbf20274ad8006bedd3db901e1ef3

Zobrazit plný text záznamu

Reinforcement Learning for Datacenter Congestion Control

Autor: Chen Tessler, Yuval Shpigelman, Gal Dalal, Amit Mandelbaum, Doron Haritan Kazakov, Benjamin Fuhrer, Gal Chechik, Shie Mannor

We approach the task of network congestion control in datacenters using Reinforcement Learning (RL). Successful congestion control algorithms can dramatically improve latency and overall network throughput. Until today, no such learning-based algorit

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=doi_dedup___::f8380d7cc8c18eb383a7f65ade54b1ad

Zobrazit plný text záznamu

Vyhledávací nástroje:

Upřesnit hledání