Zobrazeno 1 - 1
of 1
pro vyhledávání: '"Ryan J. Lawhead"'
Autor:
Abhijit Gosavi, Ryan J. Lawhead
Publikováno v:
Engineering Applications of Artificial Intelligence. 82:252-262
Reinforcement Learning (RL) is an artificial intelligence technique used to solve Markov and semi-Markov decision processes. Actor critics form a major class of RL algorithms that suffer from a critical deficiency, which is that the values of the so-