Relationship Explainable Multi-objective Optimization Via Vector Value Function Based Reinforcement Learning

Autor:	Zhan, Huixin, Cao, Yongcan
Rok vydání:	2019
Předmět:	Electrical Engineering and Systems Science - Systems and Control Computer Science - Machine Learning Mathematics - Optimization and Control
Druh dokumentu:	Working Paper
Popis:	Solving multi-objective optimization problems is important in various applications where users are interested in obtaining optimal policies subject to multiple, yet often conflicting objectives. A typical approach to obtain optimal policies is to first construct a loss function that is based on the scalarization of individual objectives, and then find the optimal policy that minimizes the loss. However, optimizing the scalarized (and weighted) loss does not necessarily provide a guarantee of high performance on each possibly conflicting objective. In this paper, we propose a vector value based reinforcement learning approach that seeks to explicitly learn the inter-objective relationship and optimize multiple objectives based on the learned relationship. In particular, the proposed method is to first define relationship matrix, a mathematical representation of the inter-objective relationship, and then create one actor and multiple critics that can co-learn the relationship matrix and action selection. The proposed approach can quantify the inter-objective relationship via reinforcement learning when the impact of one objective on another is unknown a prior. We also provide rigorous convergence analysis of the proposed approach and present a quantitative evaluation of the approach based on two testing scenarios. Comment: COLT19 submission. arXiv admin note: substantial text overlap with arXiv:1909.12268
Databáze:	arXiv
Externí odkaz:	http://arxiv.org/abs/1910.01919 Zobrazit plný text záznamu View this record from Arxiv