Výsledky vyhledávání - "Aschenbrenner, Leopold"

Report

Weak-to-Strong Generalization: Eliciting Strong Capabilities With Weak Supervision

Autor: Burns, Collin, Izmailov, Pavel, Kirchner, Jan Hendrik, Baker, Bowen, Gao, Leo, Aschenbrenner, Leopold, Chen, Yining, Ecoffet, Adrien, Joglekar, Manas, Leike, Jan, Sutskever, Ilya, Wu, Jeff

Widely used alignment techniques, such as reinforcement learning from human feedback (RLHF), rely on the ability of humans to supervise model behavior - for example, to evaluate whether a model faithfully followed instructions or generated safe outpu

Externí odkaz: http://arxiv.org/abs/2312.09390

Zobrazit plný text záznamu

Kniha

Der Waldlaubsänger : die neue Brehm-Bücherei, 368 . / Leopold Aschenbrenner.

Autor: Aschenbrenner, Leopold

Vyhledávací nástroje:

Upřesnit hledání