Time-varying Markov decision processes with state-action-dependent discount factors and unbounded costs.

Autor:	Escobedo-Trujillo, Beatris A.
Další autoři:	Higuera-Chan, Carmen G.
Jazyk:	angličtina
Předmět:	články journal articles nekonstantní diskontní faktor časově proměnné Markovovy rozhodovací procesy discounted optimality non-constant discount factor time-varying Markov decision processes
Druh dokumentu:	Non-fiction
ISSN:	0023-5954
Abstrakt:	Abstract: In this paper we are concerned with a class of time-varying discounted Markov decision models Mn with unbounded costs cn and state-action dependent discount factors. Specifically we study controlled systems whose state process evolves according to the equation xn+1=Gn(xn, n=0,1,..., with state-action dependent discount factors of the form αn(xn, an), where an and ξn are the control and the random disturbance at time n, respectively. Assuming that the sequences of functions {αn},{cn} and {Gn} converge, in certain sense, to α∞, c∞ and G∞, our objective is to introduce a suitable control model for this class of systems and then, to show the existence of optimal policies for the limit system M∞ corresponding to α∞, c∞ and G∞. Finally, we illustrate our results and their applicability in a class of semi-Markov control models.
Databáze:	Katalog Knihovny AV ČR
Externí odkaz:	Plný text