Výsledky vyhledávání - "Chittepu, Yaswanth"

Report

Scaling Laws for Reward Model Overoptimization in Direct Alignment Algorithms

Autor: Rafailov, Rafael, Chittepu, Yaswanth, Park, Ryan, Sikchi, Harshit, Hejna, Joey, Knox, Bradley, Finn, Chelsea, Niekum, Scott

Reinforcement Learning from Human Feedback (RLHF) has been crucial to the recent success of Large Language Models (LLMs), however, it is often a complex and brittle process. In the classical RLHF framework, a reward model is first trained to represen

Externí odkaz: http://arxiv.org/abs/2406.02900

Zobrazit plný text záznamu

Vyhledávací nástroje:

Upřesnit hledání