Výsledky vyhledávání - "Raghuram Bharadwaj Diddigi"

Generalized Second-Order Value Iteration in Markov Decision Processes

Autor: Raghuram Bharadwaj Diddigi, Chandramouli Kamanchi, Shalabh Bhatnagar

Publikováno v: IEEE Transactions on Automatic Control. 67:4241-4247

Value iteration is a fixed point iteration technique utilized to obtain the optimal value function and policy in a discounted reward Markov Decision Process (MDP). Here, a contraction operator is constructed and applied repeatedly to arrive at the op

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=doi_dedup___::c8da96240508ffeedc4ef888677af7cd
https://doi.org/10.1109/tac.2021.3112851

Zobrazit plný text záznamu

Successive Over-Relaxation ${Q}$ -Learning

Autor: Chandramouli Kamanchi, Raghuram Bharadwaj Diddigi, Shalabh Bhatnagar

Publikováno v: IEEE Control Systems Letters. 4:55-60

In a discounted reward Markov decision process (MDP), the objective is to find the optimal value function, i.e., the value function corresponding to an optimal policy. This problem reduces to solving a functional equation known as the Bellman equatio

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=doi_dedup___::71f546c335ffb721d79178b350e6e552
https://doi.org/10.1109/lcsys.2019.2921158

Zobrazit plný text záznamu

A Stochastic Game Framework for Efficient Energy Management in Microgrid Networks

Autor: Annanya Pratap Singh Chauhan, Prishita Ray, Abhinava Sikdar, Sai Koti Reddy Danda, Shalabh Bhatnagar, Chanakya Ajit Ekbote, Shravan Nayak, Raghuram Bharadwaj Diddigi

Publikováno v: ISGT-Europe

We consider the problem of energy management in microgrid networks. A microgrid is capable of generating a limited amount of energy from a renewable resource and is responsible for handling the demands of its dedicated customers. Owing to the variabl

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=doi_dedup___::df7ca6cb892b36b8a8cebf496249ef05
http://arxiv.org/abs/2002.02084

Zobrazit plný text záznamu

A Generalized Minimax Q-learning Algorithm for Two-Player Zero-Sum Stochastic Games

Autor: Raghuram Bharadwaj Diddigi, Chandramouli Kamanchi, Shalabh Bhatnagar

We consider the problem of two-player zero-sum games. This problem is formulated as a min-max Markov game in the literature. The solution of this game, which is the min-max payoff, starting from a given state is called the min-max value of the state.

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=doi_dedup___::f9ee47ce796dd62c1772e1ddc69b8dc3
http://arxiv.org/abs/1906.06659

Zobrazit plný text záznamu

An Online Sample Based Method for Mode Estimation using ODE Analysis of Stochastic Approximation Algorithms

Autor: K J Prabuchandran, Shalabh Bhatnagar, Chandramouli Kamanchi, Raghuram Bharadwaj Diddigi

One of the popular measures of central tendency that provides better representation and interesting insights of the data compared to the other measures like mean and median is the metric mode. If the analytical form of the density function is known,

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=doi_dedup___::cf50dfa084e3b67859b0e395ab8a4e7b
http://arxiv.org/abs/1902.03806

Zobrazit plný text záznamu

Novel Sensor Scheduling Scheme for Intruder Tracking in Energy Efficient Sensor Networks

Autor: Shalabh Bhatnagar, K J Prabuchandran, Raghuram Bharadwaj Diddigi

We consider the problem of tracking an intruder using a network of wireless sensors. For tracking the intruder at each instant, the optimal number and the right configuration of sensors has to be powered. As powering the sensors consumes energy, ther

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=doi_dedup___::82ee9ef214ae62db1ef5b86964070f29
http://arxiv.org/abs/1708.08113

Zobrazit plný text záznamu

Vyhledávací nástroje:

Upřesnit hledání