Multi-armed Bandit with Additional Observations

Autor:	Alexandre Proutiere, Jinwoo Shin, Donggyu Yun, Yung Yi, Sumyeong Ahn
Rok vydání:	2018
Předmět:	Mathematical optimization Computer Networks and Communications Computer science 020206 networking & telecommunications 02 engineering and technology 010501 environmental sciences 01 natural sciences Multi-armed bandit 010104 statistics & probability Hardware and Architecture 0202 electrical engineering electronic engineering information engineering Reinforcement learning 0101 mathematics Online algorithm Software 0105 earth and related environmental sciences
Zdroj:	SIGMETRICS (Abstracts)
DOI:	10.1145/3219617.3219639
Popis:	We study multi-armed bandit (MAB) problems with additional observations, where in each round, the decision maker selects an arm to play and can also observe rewards of additional arms (within a given budget) by paying certain costs. We propose algorithms that are asymptotic-optimal and order-optimal in their regrets under the settings of stochastic and adversarial rewards, respectively.
Databáze:	OpenAIRE
Externí odkaz:	https://explore.openaire.eu/search/publication?articleId=doi_dedup___::a9024e79a23372d59ee08642b1b13fa3 https://doi.org/10.1145/3219617.3219639 Zobrazit plný text záznamu