Výsledky vyhledávání - "Naumaan Nayyar"

On Regret-Optimal Learning in Decentralized Multiplayer Multiarmed Bandits

Autor: Rahul Jain, Dileep Kalathil, Naumaan Nayyar

Publikováno v: IEEE Transactions on Control of Network Systems. 5:597-606

We consider the problem of learning in single-player and multiplayer multiarmed bandit models. Bandit problems are classes of online learning problems that capture exploration versus exploitation tradeoffs. In a multiarmed bandit model, players can p

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=doi_________::36cef13328c2f62a5a482c907418f467
https://doi.org/10.1109/tcns.2016.2635380

Zobrazit plný text záznamu

Optimal Decentralized Control With Asymmetric One-Step Delayed Information Sharing

Autor: Naumaan Nayyar, Rahul Jain, Dileep Kalathil

Publikováno v: IEEE Transactions on Control of Network Systems. 5:653-663

We consider optimal control of decentralized LQG problems for plants controlled by two players having asymmetric information sharing patterns between them. In one scenario, players are assumed to have a bidirectional error-free, unlimited rate commun

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=doi_dedup___::a5b5cb6b9d58941c2d120230aec4025f
https://doi.org/10.1109/tcns.2016.2641802

Zobrazit plný text záznamu

Decentralized Learning for Multiplayer Multiarmed Bandits

Autor: Naumaan Nayyar, Dileep Kalathil, Rahul Jain

Publikováno v: IEEE Transactions on Information Theory. 60:2331-2345

We consider the problem of distributed online learning with multiple players in multiarmed bandit (MAB) models. Each player can pick among multiple arms. When a player picks an arm, it gets a reward. We consider both independent identically distribut

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=doi_________::1ebb948de56b3a80e2e795af28257bbe
https://doi.org/10.1109/tit.2014.2302471

Zobrazit plný text záznamu

Optimal decentralized control in unidirectional one-step delayed sharing pattern with partial output feedback

Autor: Rahul Jain, Naumaan Nayyar, Dileep Kalathil

Publikováno v: ACC

We consider optimal decentralized LQG control for a plant with nested structure controlled by two players receiving partial output observations from the plant. A unidirectional one-step delayed error-free communication channel is assumed to exist bet

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=doi_________::ac70d9b4c921a799364487c35f041b22
https://doi.org/10.1109/acc.2014.6858820

Zobrazit plný text záznamu

Decentralized learning for multi-player multi-armed bandits

Autor: Naumaan Nayyar, Dileep Kalathil, Rahul Jain

Publikováno v: CDC

We consider the problem of distributed online learning with multiple players in multi-armed bandits (MAB) models. Each player can pick among multiple arms. When a player picks an arm, it gets a reward. We consider both i.i.d. reward model and Markovi

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=doi_dedup___::6dd9900e77b1462ea8213d971ae61879
https://doi.org/10.1109/cdc.2012.6426587

Zobrazit plný text záznamu

Multi-player multi-armed bandits: Decentralized learning with IID rewards

Autor: Rahul Jain, Naumaan Nayyar, Dileep Kalathil

Publikováno v: Allerton Conference

We consider the decentralized multi-armed bandit problem with distinct arms for each players. Each player can pick one arm at each time instant and can get a random reward from an unknown distribution with an unknown mean. The arms give different rew

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=doi_________::1881b6e5122e2b3ae909cadfea968b86
https://doi.org/10.1109/allerton.2012.6483307

Zobrazit plný text záznamu

On a restless multi-armed bandit problem with non-identical arms

Autor: Yi Gai, Naumaan Nayyar, Bhaskar Krishnamachari

Publikováno v: Allerton

We consider the following learning problem motivated by opportunistic spectrum access in cognitive radio networks. There are N independent Gilbert-Elliott channels with possibly non-identical transition matrices. It is desired to have an online polic

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=doi_________::7099826eceefa9256f74855d05bba744
https://doi.org/10.1109/allerton.2011.6120191

Zobrazit plný text záznamu

Thoracic Epidural Analgesia Does Not Require Prolonged Urinary Catheterization

Autor: Naumaan Nayyar, Rahul Jain, S. Kotova, V. Sah

Publikováno v: Journal of Surgical Research. 186:496

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=doi_________::8bb5b49510dfbd3fbd9f49be6df0688a
https://doi.org/10.1016/j.jss.2013.11.051

Zobrazit plný text záznamu

Vyhledávací nástroje:

Upřesnit hledání