Zobrazeno 1 - 10
of 210
pro vyhledávání: '"Benac P"'
Autor:
Bovenzi, Inko, Carmel, Adi, Hu, Michael, Hurwitz, Rebecca M., McBride, Fiona, Benac, Leo, Ayala, José Roberto Tello, Doshi-Velez, Finale
In aims to uncover insights into medical decision-making embedded within observational data from clinical settings, we present a novel application of Inverse Reinforcement Learning (IRL) that identifies suboptimal clinician actions based on the actio
Externí odkaz:
http://arxiv.org/abs/2411.05237
We consider the problem of estimating the transition dynamics $T^*$ from near-optimal expert trajectories in the context of offline model-based reinforcement learning. We develop a novel constraint-based method, Inverse Transition Learning, that trea
Externí odkaz:
http://arxiv.org/abs/2411.05174
Within batch reinforcement learning, safe policy improvement (SPI) seeks to ensure that the learnt policy performs at least as well as the behavior policy that generated the dataset. The core challenge in SPI is seeking improvements while balancing r
Externí odkaz:
http://arxiv.org/abs/2410.09361
Offline Reinforcement learning is commonly used for sequential decision-making in domains such as healthcare and education, where the rewards are known and the transition dynamics $T$ must be estimated on the basis of batch data. A key challenge for
Externí odkaz:
http://arxiv.org/abs/2308.05075
Let ${\mathbf d} =(d_j)_{j\in\mathbb{I}_m}\in \mathbb{N}^m$ be a decreasing finite sequence of positive integers, and let $\alpha=(\alpha_i)_{i\in\mathbb{I}_n}$ be a finite and non-increasing sequence of positive weights. Given a family $\Phi^0=(\mat
Externí odkaz:
http://arxiv.org/abs/2212.12004
Autor:
Thompson, Jordan, Benac, Ryan, Olana, Kidus, Hassan, Talha, Sward, Andrew, Mohd, Tauheed Khan
NFTrig is a web-based application created for use as an educational tool to teach trigonometry and block chain technology. Creation of the application includes front and back end development as well as integration with other outside sources including
Externí odkaz:
http://arxiv.org/abs/2301.00001
Autor:
Benac, Leo, Godin, Frédéric
This paper tackles the risk averse multi-armed bandits problem when incurred losses are non-stationary. The conditional value-at-risk (CVaR) is used as the objective function. Two estimation methods are proposed for this objective function in the pre
Externí odkaz:
http://arxiv.org/abs/2109.13977
Autor:
Dustin D. Benac
Publikováno v:
Religions, Vol 15, Iss 10, p 1154 (2024)
Practical theology has historically engaged in sustained theological reflection on the practices of the Church that intersect with the practices of the world. As field of study, it engages in interdisciplinary engagement that combines social and theo
Externí odkaz:
https://doaj.org/article/b114b1a1319d4bc2b8fbd02cf9df39b2
Publikováno v:
Journal of Marine Science and Engineering, Vol 12, Iss 9, p 1575 (2024)
Prvić Island (Kvarner area in the NE channel part of the Adriatic Sea) is a part of the Natura 2000 protected area network. A recent tombolo is located on the SW coast of Prvić Island, and much larger submerged tombolos are located on the shoal tow
Externí odkaz:
https://doaj.org/article/9e44a0d93c4b49018a6a743efc81ebd8
Autor:
Smolinski, Jason P., Hoogendam, Willem B., Van Kooten, Alex J., Benac, Peyton, Bruce, Zachary J.
Publikováno v:
AJ, 160, 5 (2020)
We seek to resolve the tension in the literature regarding the presence of radially segregated multiple populations in Galactic globular cluster M13. Previous studies of this nearby cluster have presented discordant results about the degree of dynami
Externí odkaz:
http://arxiv.org/abs/2011.08684