Zobrazeno 1 - 6
of 6
pro vyhledávání: '"Raghuram Bharadwaj Diddigi"'
Publikováno v:
IEEE Transactions on Automatic Control. 67:4241-4247
Value iteration is a fixed point iteration technique utilized to obtain the optimal value function and policy in a discounted reward Markov Decision Process (MDP). Here, a contraction operator is constructed and applied repeatedly to arrive at the op
Publikováno v:
IEEE Control Systems Letters. 4:55-60
In a discounted reward Markov decision process (MDP), the objective is to find the optimal value function, i.e., the value function corresponding to an optimal policy. This problem reduces to solving a functional equation known as the Bellman equatio
Autor:
Annanya Pratap Singh Chauhan, Prishita Ray, Abhinava Sikdar, Sai Koti Reddy Danda, Shalabh Bhatnagar, Chanakya Ajit Ekbote, Shravan Nayak, Raghuram Bharadwaj Diddigi
Publikováno v:
ISGT-Europe
We consider the problem of energy management in microgrid networks. A microgrid is capable of generating a limited amount of energy from a renewable resource and is responsible for handling the demands of its dedicated customers. Owing to the variabl
Externí odkaz:
https://explore.openaire.eu/search/publication?articleId=doi_dedup___::df7ca6cb892b36b8a8cebf496249ef05
http://arxiv.org/abs/2002.02084
http://arxiv.org/abs/2002.02084
We consider the problem of two-player zero-sum games. This problem is formulated as a min-max Markov game in the literature. The solution of this game, which is the min-max payoff, starting from a given state is called the min-max value of the state.
Externí odkaz:
https://explore.openaire.eu/search/publication?articleId=doi_dedup___::f9ee47ce796dd62c1772e1ddc69b8dc3
http://arxiv.org/abs/1906.06659
http://arxiv.org/abs/1906.06659
One of the popular measures of central tendency that provides better representation and interesting insights of the data compared to the other measures like mean and median is the metric mode. If the analytical form of the density function is known,
Externí odkaz:
https://explore.openaire.eu/search/publication?articleId=doi_dedup___::cf50dfa084e3b67859b0e395ab8a4e7b
http://arxiv.org/abs/1902.03806
http://arxiv.org/abs/1902.03806
We consider the problem of tracking an intruder using a network of wireless sensors. For tracking the intruder at each instant, the optimal number and the right configuration of sensors has to be powered. As powering the sensors consumes energy, ther
Externí odkaz:
https://explore.openaire.eu/search/publication?articleId=doi_dedup___::82ee9ef214ae62db1ef5b86964070f29
http://arxiv.org/abs/1708.08113
http://arxiv.org/abs/1708.08113