Výsledky vyhledávání - "Jacq, Alexis"

Report

On the importance of data collection for training general goal-reaching policies

Autor: Jacq, Alexis, Orsini, Manu, Dulac-Arnold, Gabriel, Pietquin, Olivier, Geist, Matthieu, Bachem, Olivier

Recent advances in ML suggest that the quantity of data available to a model is one of the primary bottlenecks to high performance. Although for language-based tasks there exist almost unlimited amounts of reasonably coherent data to train from, this

Externí odkaz: http://arxiv.org/abs/2211.03521

Zobrazit plný text záznamu

Report

Lazy-MDPs: Towards Interpretable Reinforcement Learning by Learning When to Act

Autor: Jacq, Alexis, Ferret, Johan, Pietquin, Olivier, Geist, Matthieu

Publikováno v: Autonomous Agents and Multi-Agent Systems (2022)

Traditionally, Reinforcement Learning (RL) aims at deciding how to act optimally for an artificial agent. We argue that deciding when to act is equally important. As humans, we drift from default, instinctive or memorized behaviors to focused, though

Externí odkaz: http://arxiv.org/abs/2203.08542

Zobrazit plný text záznamu

Report

Acme: A Research Framework for Distributed Reinforcement Learning

Deep reinforcement learning (RL) has led to many recent and groundbreaking advances. However, these advances have often come at the cost of both increased scale in the underlying architectures being trained as well as increased complexity of the RL a

Externí odkaz: http://arxiv.org/abs/2006.00979

Zobrazit plný text záznamu

Report

Foolproof Cooperative Learning

Autor: Jacq, Alexis, Perolat, Julien, Geist, Matthieu, Pietquin, Olivier

Publikováno v: Proceedings of The 12th Asian Conference on Machine Learning, PMLR 129:401-416, 2020

This paper extends the notion of learning equilibrium in game theory from matrix games to stochastic games. We introduce Foolproof Cooperative Learning (FCL), an algorithm that converges to a Tit-for-Tat behavior. It allows cooperative strategies whe

Externí odkaz: http://arxiv.org/abs/1906.09831

Zobrazit plný text záznamu

Report

Cognitive Architecture for Mutual Modelling

Autor: Jacq, Alexis, Johal, Wafa, Dillenbourg, Pierre, Paiva, Ana

In social robotics, robots needs to be able to be understood by humans. Especially in collaborative tasks where they have to share mutual knowledge. For instance, in an educative scenario, learners share their knowledge and they must adapt their beha

Externí odkaz: http://arxiv.org/abs/1602.06703

Zobrazit plný text záznamu

Mutual Understanding in Educational Human-Robot Collaborations

Autor: Jacq, Alexis David

Education is an art close to theater. A teacher is taking a role; he works his speeches and his gestures and he plays with the attention of his audience. But it is harder: more than entertaining, a teacher must shape the skills, the knowledge and the

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=doi_dedup___::c048824cdfb95d4d0176546c9ca699a9

Zobrazit plný text záznamu

Expressing Motivations By Facilitating Other's Inverse Reinforcement Learning

Autor: Jacq, Alexis, Johal, Wafa, Paiva, Ana, Dillenbourg, Pierre

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=doi_________::2bc63e2a0efc26c4aceb6e2040d27928

Zobrazit plný text záznamu

Akademický článek

Tento výsledek nelze pro nepřihlášené uživatele zobrazit.
K zobrazení výsledku je třeba se přihlásit.

Vyhledávací nástroje:

Upřesnit hledání