Výsledky vyhledávání - "Kang, Sehyeok"

Report

Autor: Kim, Minu, Lee, Yongsik, Kang, Sehyeok, Oh, Jihwan, Chong, Song, Yun, Se-Young

We present Preference Flow Matching (PFM), a new framework for preference-based reinforcement learning (PbRL) that streamlines the integration of preferences into an arbitrary class of pre-trained models. Existing PbRL methods require fine-tuning pre

Externí odkaz: http://arxiv.org/abs/2405.19806

Zobrazit plný text záznamu

Vyhledávací nástroje:

Upřesnit hledání