Dynamic programming with meta-reinforcement learning: a novel approach for multi-objective optimization

Autor:	Qi Wang, Chengwei Zhang, Bin Hu
Jazyk:	angličtina
Rok vydání:	2024
Předmět:	Combinatorial optimization Meta-learning Reinforcement learning Dynamic programming Electronic computers. Computer science QA75.5-76.95 Information technology T58.5-58.64
Zdroj:	Complex & Intelligent Systems, Vol 10, Iss 4, Pp 5743-5758 (2024)
Druh dokumentu:	article
ISSN:	2199-4536 2198-6053
DOI:	10.1007/s40747-024-01469-1
Popis:	Abstract Multi-objective optimization (MOO) endeavors to identify optimal solutions from a finite array of possibilities. In recent years, deep reinforcement learning (RL) has exhibited promise through its well-crafted heuristics in tackling NP-hard combinatorial optimization (CO) problems. Nonetheless, current methodologies grapple with two key challenges: (1) They primarily concentrate on single-objective optimization quandaries, rendering them less adaptable to the more prevalent MOO scenarios encountered in real-world applications. (2) These approaches furnish an approximate solution by imbibing heuristics, lacking a systematic means to enhance or substantiate optimality. Given these challenges, this study introduces an overarching hybrid strategy, dynamic programming with meta-reinforcement learning (DPML), to resolve MOO predicaments. The approach melds meta-learning into an RL framework, addressing multiple subproblems inherent to MOO. Furthermore, the precision of solutions is elevated by endowing exact dynamic programming with the prowess of meta-graph neural networks. Empirical results substantiate the supremacy of our methodology over previous RL and heuristics approaches, bridging the chasm between theoretical underpinnings and real-world applicability within this domain.
Databáze:	Directory of Open Access Journals
Externí odkaz:	https://doaj.org/article/7f6240bac7dd4d2b8b02559d4ad9d715 Zobrazit plný text záznamu Full text from SpringerLink View record in DOAJ