Quantum bandit with amplitude amplification exploration in an adversarial environment

Autor:	Cho, Byungjin, Xiao, Yu, Hui, Pan, Dong, Daoyi
Rok vydání:	2022
Předmět:	Quantum Physics Electrical Engineering and Systems Science - Systems and Control
Druh dokumentu:	Working Paper
Popis:	The rapid proliferation of learning systems in an arbitrarily changing environment mandates the need for managing tensions between exploration and exploitation. This work proposes a quantum-inspired bandit learning approach for the learning-and-adapting-based offloading problem where a client observes and learns the costs of each task offloaded to the candidate resource providers, e.g., fog nodes. In this approach, a new action update strategy and novel probabilistic action selection are adopted, provoked by the amplitude amplification and collapse postulate in quantum computation theory, respectively. We devise a locally linear mapping between a quantum-mechanical phase in a quantum domain, e.g., Grover-type search algorithm, and a distilled probability-magnitude in a value-based decision-making domain, e.g., adversarial multi-armed bandit algorithm. The proposed algorithm is generalized, via the devised mapping, for better learning weight adjustments on favourable/unfavourable actions and its effectiveness is verified via simulation. Comment: Accepted to appear in IEEE TKDE
Databáze:	arXiv
Externí odkaz:	http://arxiv.org/abs/2208.07144 Zobrazit plný text záznamu View this record from Arxiv