Expert knowledge data-driven based actor–critic reinforcement learning framework to solve computationally expensive unit commitment problems with uncertain wind energy

Autor:	Huijun Liang, Chenhao Lin, Aokang Pang
Jazyk:	angličtina
Rok vydání:	2024
Předmět:	Unit commitment Reinforcement learning Knowledge data-driven Surrogate model Meta-heuristic algorithm Production of electric energy or power. Powerplants. Central stations TK1001-1841
Zdroj:	International Journal of Electrical Power & Energy Systems, Vol 159, Iss , Pp 110033- (2024)
Druh dokumentu:	article
ISSN:	0142-0615
DOI:	10.1016/j.ijepes.2024.110033
Popis:	With the expansion of power grid, unaffordable computational cost and time will pose serious challenges of time-efficient scheduling in unit commitment problem (UCP). However, existing optimization methods, i.e., mathematical programming methods and meta-heuristic algorithms, are powerless and time-consuming to handle computationally expensive UCP (CEUCP). Thus, reinforcement learning methods with strong inference and time-saving performances are motivated to solve the computationally expensive challenges in tackling CEUCPs. In this paper, a novel expert knowledge data-driven based actor–critic (AC) reinforcement learning methodology is proposed for solving CEUCPs. Specifically, in the proposed AC reinforcement learning methodology, expert knowledge, data-driven surrogate model, and improved meta-heuristic algorithm are integrated for further performance enhancement. Firstly, a novel action selection mechanism (based on the expert knowledge of thermal units characteristic) is integrated into AC to improve the efficiency of network training. Secondly, an improved extreme learning machine (ELM) data-driven surrogate model is proposed to build reward function in AC. In detail, original cost function in reward is replaced by a lightweight ELM model. Shape distance is integrated into ELM for enhancing accuracy. Finally, original marine predators algorithm (MPA) is improved for obtaining optimal dispatching decisions and rewards of AC method quickly and correctly. Original search pattern is replaced by quantum based representation for boosting convergence. The excellent performances of the proposed AC framework are verified by simulations of 10-units, 100-units, and 100-units with wind energy test systems.
Databáze:	Directory of Open Access Journals
Externí odkaz:	https://doaj.org/article/34473a8faa734d6c8e8d08677bfc3277 Zobrazit plný text záznamu Full Text from ScienceDirect View record in DOAJ