A Relative Value Iteration Algorithm for Non-degenerate Controlled Diffusions
Autor: | Arapostathis, Ari, Borkar, Vivek S. |
---|---|
Rok vydání: | 2011 |
Předmět: | |
Zdroj: | SIAM J. Control Optim. 50 (2012), no. 4, 1886-1902 |
Druh dokumentu: | Working Paper |
DOI: | 10.1137/110850529 |
Popis: | The ergodic control problem for a non-degenerate controlled diffusion controlled through its drift is considered under a uniform stability condition that ensures the well-posedness of the associated Hamilton-Jacobi-Bellman (HJB) equation. A nonlinear parabolic evolution equation is then proposed as a continuous time continuous state space analog of White's `relative value iteration' algorithm for solving the ergodic dynamic programming equation for the finite state finite action case. Its convergence to the solution of the HJB equation is established using the theory of monotone dynamical systems and also, alternatively, by using the theory of reverse martingales. Comment: 17 pages |
Databáze: | arXiv |
Externí odkaz: |