Gaussian Two-Armed Bandit and Optimization of Batch Data Processing
Autor: | Alexander V. Kolnogorov |
---|---|
Rok vydání: | 2018 |
Předmět: |
Mathematical optimization
Data processing Computer Networks and Communications Gaussian 010102 general mathematics Bayesian probability Process (computing) 020206 networking & telecommunications 02 engineering and technology Minimax 01 natural sciences Computer Science Applications symbols.namesake Prior probability 0202 electrical engineering electronic engineering information engineering symbols A priori and a posteriori 0101 mathematics Game theory Information Systems Mathematics |
Zdroj: | Problems of Information Transmission. 54:84-100 |
ISSN: | 1608-3253 0032-9460 |
DOI: | 10.1134/s0032946018010076 |
Popis: | We consider the minimax setting for the two-armed bandit problem with normally distributed incomes having a priori unknown mathematical expectations and variances. This setting naturally arises in optimization of batch data processing where two alternative processing methods are available with different a priori unknown efficiencies. During the control process, it is required to determine the most efficient method and ensure its predominant application. We use the main theorem of game theory to search for minimax strategy and minimax risk as Bayesian ones corresponding to the worst-case prior distribution. To find them, a recursive integro-difference equation is obtained. We show that batch data processing almost does not increase the minimax risk if the number of batches is large enough. |
Databáze: | OpenAIRE |
Externí odkaz: |