Výsledky vyhledávání - "Aissi, Mohamed Salim"

Report

Reinforcement Learning for Aligning Large Language Models Agents with Interactive Environments: Quantifying and Mitigating Prompt Overfitting

Autor: Aissi, Mohamed Salim, Romac, Clement, Carta, Thomas, Lamprier, Sylvain, Oudeyer, Pierre-Yves, Sigaud, Olivier, Soulier, Laure, Thome, Nicolas

Reinforcement learning (RL) is a promising approach for aligning large language models (LLMs) knowledge with sequential decision-making tasks. However, few studies have thoroughly investigated the impact on LLM agents capabilities of fine-tuning them

Externí odkaz: http://arxiv.org/abs/2410.19920

Zobrazit plný text záznamu

Vyhledávací nástroje:

Upřesnit hledání