Zobrazeno 1 - 3
of 3
pro vyhledávání: '"Gareev, Daniel"'
Text generation, a key component in applications such as dialogue systems, relies on decoding algorithms that sample strings from a language model distribution. Traditional methods, such as top-$k$ and top-$\pi$, apply local normalisation to the mode
Externí odkaz:
http://arxiv.org/abs/2410.10810
Reinforcement learning (RL) has emerged as a powerful approach for tackling complex problems. The recent introduction of multi-objective reinforcement learning (MORL) has further expanded the scope of RL by enabling agents to make trade-offs among mu
Externí odkaz:
http://arxiv.org/abs/2310.16487
Publikováno v:
In IFAC PapersOnLine 2022 55(10):3292-3297