Zobrazeno 1 - 1
of 1
pro vyhledávání: '"Cai, Yaobang"'
As a data-driven paradigm, offline reinforcement learning (Offline RL) has been formulated as sequence modeling, where the Decision Transformer (DT) has demonstrated exceptional capabilities. Unlike previous reinforcement learning methods that fit va
Externí odkaz:
http://arxiv.org/abs/2409.08062