Extreme Bandits using Robust Statistics

Autor:	Sujay Bhatt, Ping Li, Gennady Samorodnitsky
Rok vydání:	2021
Předmět:	FOS: Computer and information sciences Computer Science - Machine Learning Statistics - Machine Learning Machine Learning (stat.ML) Library and Information Sciences Computer Science Applications Information Systems Machine Learning (cs.LG)
DOI:	10.48550/arxiv.2109.04433
Popis:	We consider a multi-armed bandit problem motivated by situations where only the extreme values, as opposed to expected values in the classical bandit setting, are of interest. We propose distribution free algorithms using robust statistics and characterize the statistical properties. We show that the provided algorithms achieve vanishing extremal regret under weaker conditions than existing algorithms. Performance of the algorithms is demonstrated for the finite-sample setting using numerical experiments. The results show superior performance of the proposed algorithms compared to the well known algorithms.
Databáze:	OpenAIRE
Externí odkaz:	https://explore.openaire.eu/search/publication?articleId=doi_dedup___::4a96513f6dac7ed6eb66edc43d33cee9 Zobrazit plný text záznamu