Zobrazeno 1 - 1
of 1
pro vyhledávání: '"Andersen, Joakim Blach"'
Autor:
Andersen, Joakim Blach, Zhao, Qingyuan
Sequential decision problems are widely studied across many areas of science. A key challenge when learning policies from historical data - a practice commonly referred to as off-policy learning - is how to ``identify'' the impact of a policy of inte
Externí odkaz:
http://arxiv.org/abs/2501.00854