Markov Strategies in Dynamic Programming

Autor:	van Km Kees Hee
Rok vydání:	1978
Předmět:	Mathematical optimization Markov kernel Markov chain General Mathematics Variable-order Markov model Mathematics::General Topology Markov process Management Science and Operations Research Markov model Computer Science Applications symbols.namesake Mathematics::Probability Markov algorithm symbols Markov property Markov decision process Mathematics
Zdroj:	Mathematics of Operations Research. 3:37-41
ISSN:	1526-5471 0364-765X
DOI:	10.1287/moor.3.1.37
Popis:	It is shown that the supremum of the expected total return over the Markov strategies equals the supremum over all strategies. The model assumptions are: the state space is countable, the action space is measurable and the supremum of the expected total of the positive rewards over the Markov strategies is finite.
Databáze:	OpenAIRE
Externí odkaz:	https://explore.openaire.eu/search/publication?articleId=doi_________::3afcd824dfc110610b2b83d00338df99 https://doi.org/10.1287/moor.3.1.37 Zobrazit plný text záznamu Plný text ve formátu PDF