Výsledky vyhledávání - "Mete, Akshay"

Report

Value-Biased Maximum Likelihood Estimation for Model-based Reinforcement Learning in Discounted Linear MDPs

Autor: Hung, Yu-Heng, Hsieh, Ping-Chun, Mete, Akshay, Kumar, P. R.

We consider the infinite-horizon linear Markov Decision Processes (MDPs), where the transition probabilities of the dynamic model can be linearly parameterized with the help of a predefined low-dimensional feature mapping. While the existing regressi

Externí odkaz: http://arxiv.org/abs/2310.11515

Zobrazit plný text záznamu

Report

Finite Time Regret Bounds for Minimum Variance Control of Autoregressive Systems with Exogenous Inputs

Autor: Singh, Rahul, Mete, Akshay, Kar, Avik, Kumar, P. R.

Minimum variance controllers have been employed in a wide-range of industrial applications. A key challenge experienced by many adaptive controllers is their poor empirical performance in the initial stages of learning. In this paper, we address the

Externí odkaz: http://arxiv.org/abs/2305.16974

Zobrazit plný text záznamu

Report

Augmented RBMLE-UCB Approach for Adaptive Control of Linear Quadratic Systems

Autor: Mete, Akshay, Singh, Rahul, Kumar, P. R.

We consider the problem of controlling an unknown stochastic linear system with quadratic costs - called the adaptive LQ control problem. We re-examine an approach called ''Reward Biased Maximum Likelihood Estimate'' (RBMLE) that was proposed more th

Externí odkaz: http://arxiv.org/abs/2201.10542

Zobrazit plný text záznamu

Report

Reward Biased Maximum Likelihood Estimation for Reinforcement Learning

Autor: Mete, Akshay, Singh, Rahul, Liu, Xi, Kumar, P. R.

The Reward-Biased Maximum Likelihood Estimate (RBMLE) for adaptive control of Markov chains was proposed to overcome the central obstacle of what is variously called the fundamental "closed-identifiability problem" of adaptive control, the "dual cont

Externí odkaz: http://arxiv.org/abs/2011.07738

Zobrazit plný text záznamu

Report

Partial Server Pooling in Redundancy Systems

Autor: Mete, Akshay, Manjunath, D., Nair, Jayakrishnan, Prabhu, Balakrishna

Partial sharing allows providers to possibly pool a fraction of their resources when full pooling is not beneficial to them. Recent work in systems without sharing has shown that redundancy can improve performance considerably. In this paper, we comb

Externí odkaz: http://arxiv.org/abs/1906.03587

Zobrazit plný text záznamu

Vyhledávací nástroje:

Upřesnit hledání