Výsledky vyhledávání

Report

Reinforcement Learning for Aligning Large Language Models Agents with Interactive Environments: Quantifying and Mitigating Prompt Overfitting

Autor: Aissi, Mohamed Salim, Romac, Clement, Carta, Thomas, Lamprier, Sylvain, Oudeyer, Pierre-Yves, Sigaud, Olivier, Soulier, Laure, Thome, Nicolas

Reinforcement learning (RL) is a promising approach for aligning large language models (LLMs) knowledge with sequential decision-making tasks. However, few studies have thoroughly investigated the impact on LLM agents capabilities of fine-tuning them

Externí odkaz: http://arxiv.org/abs/2410.19920

Zobrazit plný text záznamu

Report

SAC-GLAM: Improving Online RL for LLM agents with Soft Actor-Critic and Hindsight Relabeling

Autor: Gaven, Loris, Romac, Clement, Carta, Thomas, Lamprier, Sylvain, Sigaud, Olivier, Oudeyer, Pierre-Yves

The past years have seen Large Language Models (LLMs) strive not only as generative models but also as agents solving textual sequential decision-making tasks. When facing complex environments where their zero-shot abilities are insufficient, recent

Externí odkaz: http://arxiv.org/abs/2410.12481

Zobrazit plný text záznamu

Report

From Goal-Conditioned to Language-Conditioned Agents via Vision-Language Models

Autor: Cachet, Theo, Dance, Christopher R., Sigaud, Olivier

Vision-language models (VLMs) have tremendous potential for grounding language, and thus enabling language-conditioned agents (LCAs) to perform diverse tasks specified with text. This has motivated the study of LCAs based on reinforcement learning (R

Externí odkaz: http://arxiv.org/abs/2409.16024

Zobrazit plný text záznamu

Report

The Hidden Subsidy of the Affordable Care Act

Autor: Sigaud, Liam, Bjoerkheim, Markus, Melo, Vitor

Under the ACA, the federal government paid a substantially larger share of medical costs of newly eligible Medicaid enrollees than previously eligible ones. States could save up to 100% of their per-enrollee costs by reclassifying original enrollees

Externí odkaz: http://arxiv.org/abs/2407.07217

Zobrazit plný text záznamu

Report

Physics-Informed Model and Hybrid Planning for Efficient Dyna-Style Reinforcement Learning

Autor: Asri, Zakariae El, Sigaud, Olivier, Thome, Nicolas

Applying reinforcement learning (RL) to real-world applications requires addressing a trade-off between asymptotic performance, sample efficiency, and inference time. In this work, we demonstrate how to address this triple challenge by leveraging par

Externí odkaz: http://arxiv.org/abs/2407.02217

Zobrazit plný text záznamu

Report

Rural Healthcare Access and Supply Constraints: A Causal Analysis

Autor: Melo, Vitor, Sigaud, Liam, Neilson, Elijah, Bjoerkheim, Markus

Certificate-of-need (CON) laws require that healthcare providers receive approval from a state board before offering additional services in a given community. Proponents of CON laws claim that these laws are needed to prevent the oversupply of health

Externí odkaz: http://arxiv.org/abs/2405.08168

Zobrazit plný text záznamu

Report

Single-Reset Divide & Conquer Imitation Learning

Autor: Chenu, Alexandre, Serris, Olivier, Sigaud, Olivier, Perrin-Gilbert, Nicolas

Demonstrations are commonly used to speed up the learning process of Deep Reinforcement Learning algorithms. To cope with the difficulty of accessing multiple demonstrations, some algorithms have been developed to learn from a single demonstration. I

Externí odkaz: http://arxiv.org/abs/2402.09355

Zobrazit plný text záznamu

Report

A Definition of Open-Ended Learning Problems for Goal-Conditioned Agents

Autor: Sigaud, Olivier, Baldassarre, Gianluca, Colas, Cedric, Doncieux, Stephane, Duro, Richard, Oudeyer, Pierre-Yves, Perrin-Gilbert, Nicolas, Santucci, Vieri Giuliano

A lot of recent machine learning research papers have ``open-ended learning'' in their title. But very few of them attempt to define what they mean when using the term. Even worse, when looking more closely there seems to be no consensus on what dist

Externí odkaz: http://arxiv.org/abs/2311.00344

Zobrazit plný text záznamu

Report

An Open-Loop Baseline for Reinforcement Learning Locomotion Tasks

Autor: Raffin, Antonin, Sigaud, Olivier, Kober, Jens, Albu-Schäffer, Alin, Silvério, João, Stulp, Freek

In search of a simple baseline for Deep Reinforcement Learning in locomotion tasks, we propose a model-free open-loop strategy. By leveraging prior knowledge and the elegance of simple oscillators to generate periodic joint motions, it achieves respe

Externí odkaz: http://arxiv.org/abs/2310.05808

Zobrazit plný text záznamu

Akademický článek

Bleeding management in type 3 von Willebrand disease with anti‐von Willebrand factor inhibitor: A literature review and case report

Autor: Aurélie Briane, Valérie Horvais, Marianne Sigaud, Marc Trossaërt, Nicolas Drillaud, Catherine Ternisien, Marc Fouassier, Antoine Babuty

Publikováno v: eJHaem, Vol 5, Iss 5, Pp 964-970 (2024)

Abstract Treatment of type 3 von Willebrand disease by infusion of von Willebrand factor (VWF) and factor VIII (FVIII) concentrates may lead to the development of anti‐VWF antibodies, challenging haemostasis management. The systematic review of the

Externí odkaz: https://doaj.org/article/d6a73bb4dec94851b6773375cd734b50

Zobrazit plný text záznamu

Vyhledávací nástroje:

Upřesnit hledání