Zobrazeno 1 - 10
of 16
pro vyhledávání: '"Vincent François-Lavet"'
Publikováno v:
Biomedicines, Vol 9, Iss 2, p 214 (2021)
External beam radiotherapy cancer treatment aims to deliver dose fractions to slowly destroy a tumor while avoiding severe side effects in surrounding healthy tissues. To automate the dose fraction schedules, this paper investigates how deep reinforc
Externí odkaz:
https://doaj.org/article/3dcabffce4f9495689bc09e201ae1012
Publikováno v:
den Hengst, F, François-Lavet, V, Hoogendoorn, M & van Harmelen, F 2022, ' Planning for potential: efficient safe reinforcement learning ', Machine Learning, vol. 111, no. 6, pp. 2255-2274 . https://doi.org/10.1007/s10994-022-06143-6
Machine Learning, 111(6), 2255-2274. Springer Netherlands
Machine Learning, 111(6), 2255-2274. Springer Netherlands
Deep reinforcement learning (DRL) has shown remarkable success in artificial domains and in some real-world applications. However, substantial challenges remain such as learning efficiently under safety constraints. Adherence to safety constraints is
Autor:
Serge Saaybi, Chris J. M. Verhoeven, Tomas van Rietbergen, Vincent François-Lavet, Amjad Yousef Majid, R. Venkatesha Prasad
Deep Reinforcement Learning (DRL) has the potential to surpass human-level control in sequential decision-making problems. Evolution Strategies (ESs) have different characteristics than DRL, yet they are promoted as a scalable alternative. To get ins
Externí odkaz:
https://explore.openaire.eu/search/publication?articleId=doi_dedup___::5ae16b16429f192186033d12b3645fdc
https://doi.org/10.36227/techrxiv.14679504
https://doi.org/10.36227/techrxiv.14679504
Publikováno v:
Biomedicines, Vol 9, Iss 214, p 214 (2021)
Biomedicines
Volume 9
Issue 2
Biomedicines, Vol. 9, no.2, p. 214 (2021)
Biomedicines
Volume 9
Issue 2
Biomedicines, Vol. 9, no.2, p. 214 (2021)
External beam radiotherapy cancer treatment aims to deliver dose fractions to slowly destroy a tumor while avoiding severe side effects in surrounding healthy tissues. To automate the dose fraction schedules, this paper investigates how deep reinforc
Publikováno v:
IJCAI
By virtue of their expressive power, neural networks (NNs) are well suited to fitting large, complex datasets, yet they are also known to produce similar predictions for points outside the training distribution. As such, they are, like humans, under
Publikováno v:
IJCAI
When an agent has limited information on its environment, the suboptimality of an RL algorithm can be decomposed into the sum of two terms: a term related to an asymptotic bias (suboptimality with unlimited data) and a term due to overfitting (additi
Publikováno v:
Foundations and Trends® in Machine Learning. 11:219-354
Deep reinforcement learning is the combination of reinforcement learning (RL) and deep learning. This field of research has been able to solve a wide range of complex decision-making tasks that were previously out of reach for a machine. Thus, deep R
Publikováno v:
AAAI
In the quest for efficient and robust reinforcement learning methods, both model-free and model-based approaches offer advantages. In this paper we propose a new way of explicitly bridging both approaches via a shared low-dimensional learned encoding
Externí odkaz:
https://explore.openaire.eu/search/publication?articleId=doi_dedup___::ba07fc0fe68863bff3c12bdace2e8ee6
http://arxiv.org/abs/1809.04506
http://arxiv.org/abs/1809.04506
Deep reinforcement learning is the combination of reinforcement learning (RL) and deep learning. This field of research has recently been able to solve a wide range of complex decision-making tasks that were previously out of reach for a machine. Dee