Výsledky vyhledávání

Report

Geometric Active Exploration in Markov Decision Processes: the Benefit of Abstraction

Autor: De Santi, Riccardo, Joseph, Federico Arangath, Liniger, Noah, Mutti, Mirco, Krause, Andreas

How can a scientist use a Reinforcement Learning (RL) algorithm to design experiments over a dynamical system's state space? In the case of finite and Markovian systems, an area called Active Exploration (AE) relaxes the optimization problem of exper

Externí odkaz: http://arxiv.org/abs/2407.13364

Zobrazit plný text záznamu

Report

The Limits of Pure Exploration in POMDPs: When the Observation Entropy is Enough

Autor: Zamboni, Riccardo, Cirino, Duilio, Restelli, Marcello, Mutti, Mirco

The problem of pure exploration in Markov decision processes has been cast as maximizing the entropy over the state distribution induced by the agent's policy, an objective that has been extensively studied. However, little attention has been dedicat

Externí odkaz: http://arxiv.org/abs/2406.12795

Zobrazit plný text záznamu

Report

Generalized FGM dependence: Geometrical representation and convex bounds on sums

Autor: Cossette, Hélène, Marceau, Etienne, Mutti, Alessandro, Semeraro, Patrizia

Building on the one-to-one relationship between generalized FGM copulas and multivariate Bernoulli distributions, we prove that the class of multivariate distributions with generalized FGM copulas is a convex polytope. Therefore, we find sharp bounds

Externí odkaz: http://arxiv.org/abs/2406.10648

Zobrazit plný text záznamu

Report

How does Inverse RL Scale to Large State Spaces? A Provably Efficient Approach

Autor: Lazzati, Filippo, Mutti, Mirco, Metelli, Alberto Maria

In online Inverse Reinforcement Learning (IRL), the learner can collect samples about the dynamics of the environment to improve its estimate of the reward function. Since IRL suffers from identifiability issues, many theoretical works on online IRL

Externí odkaz: http://arxiv.org/abs/2406.03812

Zobrazit plný text záznamu

Report

How to Explore with Belief: State Entropy Maximization in POMDPs

Autor: Zamboni, Riccardo, Cirino, Duilio, Restelli, Marcello, Mutti, Mirco

Recent works have studied *state entropy maximization* in reinforcement learning, in which the agent's objective is to learn a policy inducing high entropy over states visitation (Hazan et al., 2019). They typically assume full observability of the s

Externí odkaz: http://arxiv.org/abs/2406.02295

Zobrazit plný text záznamu

Report

Test-Time Regret Minimization in Meta Reinforcement Learning

Autor: Mutti, Mirco, Tamar, Aviv

Meta reinforcement learning sets a distribution over a set of tasks on which the agent can train at will, then is asked to learn an optimal policy for any test task efficiently. In this paper, we consider a finite set of tasks modeled through Markov

Externí odkaz: http://arxiv.org/abs/2406.02282

Zobrazit plný text záznamu

Report

Offline Inverse RL: New Solution Concepts and Provably Efficient Algorithms

Autor: Lazzati, Filippo, Mutti, Mirco, Metelli, Alberto Maria

Inverse reinforcement learning (IRL) aims to recover the reward function of an expert agent from demonstrations of behavior. It is well-known that the IRL problem is fundamentally ill-posed, i.e., many reward functions can explain the demonstrations.

Externí odkaz: http://arxiv.org/abs/2402.15392

Zobrazit plný text záznamu

Report

A Theoretical Framework for Partially Observed Reward-States in RLHF

Autor: Kausik, Chinmaya, Mutti, Mirco, Pacchiano, Aldo, Tewari, Ambuj

The growing deployment of reinforcement learning from human feedback (RLHF) calls for a deeper theoretical investigation of its underlying models. The prevalent models of RLHF do not account for neuroscience-backed, partially-observed "internal state

Externí odkaz: http://arxiv.org/abs/2402.03282

Zobrazit plný text záznamu

Report

Exploiting Causal Graph Priors with Posterior Sampling for Reinforcement Learning

Autor: Mutti, Mirco, De Santi, Riccardo, Restelli, Marcello, Marx, Alexander, Ramponi, Giorgia

Posterior sampling allows exploitation of prior knowledge on the environment's transition dynamics to improve the sample efficiency of reinforcement learning. The prior is typically specified as a class of parametric distributions, the design of whic

Externí odkaz: http://arxiv.org/abs/2310.07518

Zobrazit plný text záznamu

Akademický článek

The catalytic activity of methyltransferase METTL15 is dispensable for its role in mitochondrial ribosome biogenesis

Autor: Christian D. Mutti, Lindsey Van Haute, Michal Minczuk

Publikováno v: RNA Biology, Vol 21, Iss 1, Pp 23-30 (2024)

Ribosomes are large macromolecular complexes composed of both proteins and RNA, that require a plethora of factors and post-transcriptional modifications for their biogenesis. In human mitochondria, the ribosomal RNA is post-transcriptionally modifie

Externí odkaz: https://doaj.org/article/f7670361fb544c68b1ceff6ea35da524

Zobrazit plný text záznamu

Vyhledávací nástroje:

Upřesnit hledání