Výsledky vyhledávání - "Alegre, Lucas N."

Report

Sample-Efficient Multi-Objective Learning via Generalized Policy Improvement Prioritization

Autor: Alegre, Lucas N., Bazzan, Ana L. C., Roijers, Diederik M., Nowé, Ann, da Silva, Bruno C.

Multi-objective reinforcement learning (MORL) algorithms tackle sequential decision problems where agents may have different preferences over (possibly conflicting) reward functions. Such algorithms often learn a set of policies (each optimized for a

Externí odkaz: http://arxiv.org/abs/2301.07784

Zobrazit plný text záznamu

Report

Optimistic Linear Support and Successor Features as a Basis for Optimal Policy Transfer

Autor: Alegre, Lucas N., Bazzan, Ana L. C., da Silva, Bruno C.

In many real-world applications, reinforcement learning (RL) agents might have to solve multiple tasks, each one typically modeled via a reward function. If reward functions are expressed linearly, and the agent has previously learned a set of polici

Externí odkaz: http://arxiv.org/abs/2206.11326

Zobrazit plný text záznamu

Report

Minimum-Delay Adaptation in Non-Stationary Reinforcement Learning via Online High-Confidence Change-Point Detection

Autor: Alegre, Lucas N., Bazzan, Ana L. C., da Silva, Bruno C.

Publikováno v: Proceedings of the 20th International Conference on Autonomous Agents and Multiagent Systems. 2021. 97-105

Non-stationary environments are challenging for reinforcement learning algorithms. If the state transition and/or reward functions change based on latent factors, the agent is effectively tasked with optimizing a behavior that maximizes performance o

Externí odkaz: http://arxiv.org/abs/2105.09452

Zobrazit plný text záznamu

Report

Quantifying the Impact of Non-Stationarity in Reinforcement Learning-Based Traffic Signal Control

Autor: Alegre, Lucas N., Bazzan, Ana L. C., da Silva, Bruno C.

Publikováno v: PeerJ Computer Science 2021

In reinforcement learning (RL), dealing with non-stationarity is a challenging issue. However, some domains such as traffic optimization are inherently non-stationary. Causes for and effects of this are manifold. In particular, when dealing with traf

Externí odkaz: http://arxiv.org/abs/2004.04778

Zobrazit plný text záznamu

Akademický článek

Tento výsledek nelze pro nepřihlášené uživatele zobrazit.
K zobrazení výsledku je třeba se přihlásit.

Kniha

Parameterized Melody Generation with Autoencoders and Temporally-Consistent Noise

Autor: Weber, Aline, Alegre, Lucas N., Tørresen, Jim, Castro da Silva, Bruno

Publikováno v: Weber, Aline Alegre, Lucas N. Tørresen, Jim Castro da Silva, Bruno . Parameterized Melody Generation with Autoencoders and Temporally-Consistent Noise. Music Proceedings of the International Conference on New Interfaces for Musical Expression. 2019, 174-179. Porto Alegre: Universidade Federal do Rio Grande do Sul

Externí odkaz: http://hdl.handle.net/10852/77398
https://www.duo.uio.no/bitstream/handle/10852/77398/2/nime2019_paper035.pdf

Zobrazit plný text záznamu

Akademický článek

Reinforcement learning vs. rule-based adaptive traffic signal control: A Fourier basis linear function approximation for traffic signal control.

Autor: Ziemke, Theresa¹ (AUTHOR) tziemke@vsp.tu-berlin.de, Alegre, Lucas N.² (AUTHOR) lnalegre@inf.ufrgs.br, Bazzan, Ana L.C.² (AUTHOR) bazzan@inf.ufrgs.br, Lujak, Marin (AUTHOR), Dusparic, Ivana (AUTHOR), Klügl, Franziska (AUTHOR), Vizzari, Giuseppe (AUTHOR)

Publikováno v: AI Communications. 2021, Vol. 34 Issue 1, p89-103. 15p.

Zobrazit plný text záznamu

Akademický článek

Tento výsledek nelze pro nepřihlášené uživatele zobrazit.
K zobrazení výsledku je třeba se přihlásit.

Akademický článek

Tento výsledek nelze pro nepřihlášené uživatele zobrazit.
K zobrazení výsledku je třeba se přihlásit.

Vyhledávací nástroje:

Upřesnit hledání