Multi-armed Bandit with Additional Observations

Autor: Alexandre Proutiere, Jinwoo Shin, Donggyu Yun, Yung Yi, Sumyeong Ahn
Rok vydání: 2018
Předmět:
Zdroj: SIGMETRICS (Abstracts)
DOI: 10.1145/3219617.3219639
Popis: We study multi-armed bandit (MAB) problems with additional observations, where in each round, the decision maker selects an arm to play and can also observe rewards of additional arms (within a given budget) by paying certain costs. We propose algorithms that are asymptotic-optimal and order-optimal in their regrets under the settings of stochastic and adversarial rewards, respectively.
Databáze: OpenAIRE