Zobrazeno 1 - 1
of 1
pro vyhledávání: '"Aissi, Mohamed Salim"'
Autor:
Aissi, Mohamed Salim, Romac, Clement, Carta, Thomas, Lamprier, Sylvain, Oudeyer, Pierre-Yves, Sigaud, Olivier, Soulier, Laure, Thome, Nicolas
Reinforcement learning (RL) is a promising approach for aligning large language models (LLMs) knowledge with sequential decision-making tasks. However, few studies have thoroughly investigated the impact on LLM agents capabilities of fine-tuning them
Externí odkaz:
http://arxiv.org/abs/2410.19920