Zobrazeno 1 - 8
of 8
pro vyhledávání: '"Naumaan Nayyar"'
Publikováno v:
IEEE Transactions on Control of Network Systems. 5:597-606
We consider the problem of learning in single-player and multiplayer multiarmed bandit models. Bandit problems are classes of online learning problems that capture exploration versus exploitation tradeoffs. In a multiarmed bandit model, players can p
Publikováno v:
IEEE Transactions on Control of Network Systems. 5:653-663
We consider optimal control of decentralized LQG problems for plants controlled by two players having asymmetric information sharing patterns between them. In one scenario, players are assumed to have a bidirectional error-free, unlimited rate commun
Publikováno v:
IEEE Transactions on Information Theory. 60:2331-2345
We consider the problem of distributed online learning with multiple players in multiarmed bandit (MAB) models. Each player can pick among multiple arms. When a player picks an arm, it gets a reward. We consider both independent identically distribut
Publikováno v:
ACC
We consider optimal decentralized LQG control for a plant with nested structure controlled by two players receiving partial output observations from the plant. A unidirectional one-step delayed error-free communication channel is assumed to exist bet
Publikováno v:
CDC
We consider the problem of distributed online learning with multiple players in multi-armed bandits (MAB) models. Each player can pick among multiple arms. When a player picks an arm, it gets a reward. We consider both i.i.d. reward model and Markovi
Publikováno v:
Allerton Conference
We consider the decentralized multi-armed bandit problem with distinct arms for each players. Each player can pick one arm at each time instant and can get a random reward from an unknown distribution with an unknown mean. The arms give different rew
Publikováno v:
Allerton
We consider the following learning problem motivated by opportunistic spectrum access in cognitive radio networks. There are N independent Gilbert-Elliott channels with possibly non-identical transition matrices. It is desired to have an online polic
Publikováno v:
Journal of Surgical Research. 186:496