Zobrazeno 1 - 2
of 2
pro vyhledávání: '"Atalar, Baran"'
Autor:
Atalar, Baran, Joe-Wong, Carlee
We consider the contextual combinatorial bandit setting where in each round, the learning agent, e.g., a recommender system, selects a subset of "arms," e.g., products, and observes rewards for both the individual base arms, which are a function of k
Externí odkaz:
http://arxiv.org/abs/2410.14586
In federated multi-armed bandit problems, maximizing global reward while satisfying minimum privacy requirements to protect clients is the main goal. To formulate such problems, we consider a combinatorial contextual bandit setting with groups and ch
Externí odkaz:
http://arxiv.org/abs/2111.14778