Výsledky vyhledávání - "Ramos, João A. Cândido"

Report

Mimicking Better by Matching the Approximate Action Distribution

Autor: Ramos, João A. Cândido, Blondé, Lionel, Takeishi, Naoya, Kalousis, Alexandros

In this paper, we introduce MAAD, a novel, sample-efficient on-policy algorithm for Imitation Learning from Observations. MAAD utilizes a surrogate reward signal, which can be derived from various sources such as adversarial games, trajectory matchin

Externí odkaz: http://arxiv.org/abs/2306.09805

Zobrazit plný text záznamu

Report

Conditional Neural Relational Inference for Interacting Systems

Autor: Ramos, Joao A. Candido, Blondé, Lionel, Armand, Stéphane, Kalousis, Alexandros

In this work, we want to learn to model the dynamics of similar yet distinct groups of interacting objects. These groups follow some common physical laws that exhibit specificities that are captured through some vectorial description. We develop a mo

Externí odkaz: http://arxiv.org/abs/2106.11083

Zobrazit plný text záznamu

Sample-Efficient On-Policy Imitation Learning from Observations

Autor: Ramos, João A. Cândido, Blondé, Lionel, Takeishi, Naoya, Kalousis, Alexandros

Imitation learning from demonstrations (ILD) aims to alleviate numerous shortcomings of reinforcement learning through the use of demonstrations. However, in most real-world applications, expert action guidance is absent, making the use of ILD imposs

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=doi_dedup___::97fab3b6c00347c2bcc39889b20cc7f9
http://arxiv.org/abs/2306.09805

Zobrazit plný text záznamu

Vyhledávací nástroje:

Upřesnit hledání