Zobrazeno 1 - 3
of 3
pro vyhledávání: '"Pan, Haoxuan"'
This paper considers a novel co-design problem of the optimal \textit{sequential} attack, whose attack strategy changes with the time series, and in which the \textit{sequential} attack selection strategy and \textit{sequential} attack signal are sim
Externí odkaz:
http://arxiv.org/abs/2311.09933
We revisit the estimation bias in policy gradients for the discounted episodic Markov decision process (MDP) from Deep Reinforcement Learning (DRL) perspective. The objective is formulated theoretically as the expected returns discounted over the tim
Externí odkaz:
http://arxiv.org/abs/2301.08442
Autor:
Liu, Zhengkun, Wang, Qianqian, Li, Mohui, Ai, Yujie, Pan, Haoxuan, Li, Pengtao, Lang, Hao, Li, Kunyu, Dong, Shouliang
Publikováno v:
In Dyes and Pigments September 2020 180