Zobrazeno 1 - 10
of 7 272
pro vyhledávání: '"Mutti A"'
How can a scientist use a Reinforcement Learning (RL) algorithm to design experiments over a dynamical system's state space? In the case of finite and Markovian systems, an area called Active Exploration (AE) relaxes the optimization problem of exper
Externí odkaz:
http://arxiv.org/abs/2407.13364
The problem of pure exploration in Markov decision processes has been cast as maximizing the entropy over the state distribution induced by the agent's policy, an objective that has been extensively studied. However, little attention has been dedicat
Externí odkaz:
http://arxiv.org/abs/2406.12795
Building on the one-to-one relationship between generalized FGM copulas and multivariate Bernoulli distributions, we prove that the class of multivariate distributions with generalized FGM copulas is a convex polytope. Therefore, we find sharp bounds
Externí odkaz:
http://arxiv.org/abs/2406.10648
In online Inverse Reinforcement Learning (IRL), the learner can collect samples about the dynamics of the environment to improve its estimate of the reward function. Since IRL suffers from identifiability issues, many theoretical works on online IRL
Externí odkaz:
http://arxiv.org/abs/2406.03812
Recent works have studied *state entropy maximization* in reinforcement learning, in which the agent's objective is to learn a policy inducing high entropy over states visitation (Hazan et al., 2019). They typically assume full observability of the s
Externí odkaz:
http://arxiv.org/abs/2406.02295
Autor:
Mutti, Mirco, Tamar, Aviv
Meta reinforcement learning sets a distribution over a set of tasks on which the agent can train at will, then is asked to learn an optimal policy for any test task efficiently. In this paper, we consider a finite set of tasks modeled through Markov
Externí odkaz:
http://arxiv.org/abs/2406.02282
Publikováno v:
Pharmacogenomics and Personalized Medicine, Vol Volume 11, Pp 179-191 (2018)
Concetta Dagostino,1,2 Massimo Allegri,2–4 Valerio Napolioni,5 Simona D’Agnelli,1 Elena Bignami,1 Antonio Mutti,1 Ron HN van Schaik6 1Department of Medicine and Surgery, University of Parma, Parma 43126, Italy; 2Study In Multidisciplinary Pain Re
Externí odkaz:
https://doaj.org/article/ef5c8d30ad4544a8b6b49204e5b797e2
Inverse reinforcement learning (IRL) aims to recover the reward function of an expert agent from demonstrations of behavior. It is well-known that the IRL problem is fundamentally ill-posed, i.e., many reward functions can explain the demonstrations.
Externí odkaz:
http://arxiv.org/abs/2402.15392
The growing deployment of reinforcement learning from human feedback (RLHF) calls for a deeper theoretical investigation of its underlying models. The prevalent models of RLHF do not account for neuroscience-backed, partially-observed "internal state
Externí odkaz:
http://arxiv.org/abs/2402.03282
Autor:
Mutti Anggitta
Publikováno v:
Journal of ASEAN Studies, Vol 4, Iss 2, Pp 178-182 (2017)
The objective of this essay is to discuss the potential future of nuclear security in Southeast Asia by examining the roles of the Association of Southeast Asian Nations (ASEAN) in establishing and maintaining regional cooperation on nuclear security
Externí odkaz:
https://doaj.org/article/4d6e0f84dad14b3f80fe5148855d2367