Zobrazeno 1 - 10
of 60
pro vyhledávání: '"Sudarsanam, Nandan"'
Robust Policy Search is the problem of learning policies that do not degrade in performance when subject to unseen environment model parameters. It is particularly relevant for transferring policies learned in a simulation environment to the real wor
Externí odkaz:
http://arxiv.org/abs/1901.00117
Publikováno v:
In Transportation Research Interdisciplinary Perspectives December 2022 16
The use of Association Rule Mining techniques in diverse contexts and domains has resulted in the creation of numerous interestingness measures. This, in turn, has motivated researchers to come up with various classification schemes for these measure
Externí odkaz:
http://arxiv.org/abs/1712.05193
Publikováno v:
Proceedings of the Thirty-Second AAAI Conference on Artificial Intelligence, New Orleans, Louisiana, USA, February 2-7, 2018
We propose a novel variant of the UCB algorithm (referred to as Efficient-UCB-Variance (EUCBV)) for minimizing cumulative regret in the stochastic multi-armed bandit (MAB) setting. EUCBV incorporates the arm elimination strategy proposed in UCB-Impro
Externí odkaz:
http://arxiv.org/abs/1711.03591
Publikováno v:
Proceedings of the 26th International Joint Conference on Artificial Intelligence, 2017, 2515-2521
In this paper we propose the Augmented-UCB (AugUCB) algorithm for a fixed-budget version of the thresholding bandit problem (TBP), where the objective is to identify a set of arms whose quality is above a threshold. A key feature of AugUCB is that it
Externí odkaz:
http://arxiv.org/abs/1704.02281
Publikováno v:
Crime Science; 8/14/2024, Vol. 13 Issue 1, p1-15, 15p
This study presents two new algorithms for solving linear stochastic bandit problems. The proposed methods use an approach from non-parametric statistics called bootstrapping to create confidence bounds. This is achieved without making any assumption
Externí odkaz:
http://arxiv.org/abs/1605.01185
Autor:
Sudarsanam, Nandan, 1981
Thesis (Ph. D.)--Massachusetts Institute of Technology, Engineering Systems Division, 2008.
Cataloged from PDF version of thesis.
Includes bibliographical references (p. 81-86).
This thesis recommends an experimentation methodology whi
Cataloged from PDF version of thesis.
Includes bibliographical references (p. 81-86).
This thesis recommends an experimentation methodology whi
Externí odkaz:
http://hdl.handle.net/1721.1/53211
Recent advancements in revenue management of taxi services: a systematic review and research agenda.
Publikováno v:
Management Review Quarterly; Jun2024, Vol. 74 Issue 2, p1029-1055, 27p
Autor:
Sudarsanam, Nandan
Thesis (M. S.)--Oklahoma State University, 2005.
Vita. Includes bibliographical references (p. 63-66).
Vita. Includes bibliographical references (p. 63-66).
Externí odkaz:
http://digital.library.okstate.edu/etd/umi-okstate-1356.pdf