Zobrazeno 1 - 10
of 1 021
pro vyhledávání: '"A. Benac"'
Autor:
Bovenzi, Inko, Carmel, Adi, Hu, Michael, Hurwitz, Rebecca M., McBride, Fiona, Benac, Leo, Ayala, José Roberto Tello, Doshi-Velez, Finale
In aims to uncover insights into medical decision-making embedded within observational data from clinical settings, we present a novel application of Inverse Reinforcement Learning (IRL) that identifies suboptimal clinician actions based on the actio
Externí odkaz:
http://arxiv.org/abs/2411.05237
We consider the problem of estimating the transition dynamics $T^*$ from near-optimal expert trajectories in the context of offline model-based reinforcement learning. We develop a novel constraint-based method, Inverse Transition Learning, that trea
Externí odkaz:
http://arxiv.org/abs/2411.05174
Within batch reinforcement learning, safe policy improvement (SPI) seeks to ensure that the learnt policy performs at least as well as the behavior policy that generated the dataset. The core challenge in SPI is seeking improvements while balancing r
Externí odkaz:
http://arxiv.org/abs/2410.09361
Offline Reinforcement learning is commonly used for sequential decision-making in domains such as healthcare and education, where the rewards are known and the transition dynamics $T$ must be estimated on the basis of batch data. A key challenge for
Externí odkaz:
http://arxiv.org/abs/2308.05075
Autor:
BÉNAC, KARINE KATIA
Publikováno v:
HYBRIDA, 2024 Jan 01(8), 164-179.
Externí odkaz:
https://www.jstor.org/stable/48778398
Let ${\mathbf d} =(d_j)_{j\in\mathbb{I}_m}\in \mathbb{N}^m$ be a decreasing finite sequence of positive integers, and let $\alpha=(\alpha_i)_{i\in\mathbb{I}_n}$ be a finite and non-increasing sequence of positive weights. Given a family $\Phi^0=(\mat
Externí odkaz:
http://arxiv.org/abs/2212.12004
Autor:
Thompson, Jordan, Benac, Ryan, Olana, Kidus, Hassan, Talha, Sward, Andrew, Mohd, Tauheed Khan
NFTrig is a web-based application created for use as an educational tool to teach trigonometry and block chain technology. Creation of the application includes front and back end development as well as integration with other outside sources including
Externí odkaz:
http://arxiv.org/abs/2301.00001
Autor:
Bueso de Barrio, Luis Eduardo, Fredlund, Lars-Åke, Herranz, Ángel, Mariño, Julio, Benac Earle, Clara
Publikováno v:
In Journal of Logical and Algebraic Methods in Programming January 2025 142
Publikováno v:
In Regional Studies in Marine Science 30 December 2024 80
Publikováno v:
In Journal of Logical and Algebraic Methods in Programming February 2025 143