Zobrazeno 1 - 1
of 1
pro vyhledávání: '"Parthasarathi Khirwadkar"'
Autor:
Kumar Ashutosh, Bhishma Dedhia, Shivaram Kalyanakrishnan, Parthasarathi Khirwadkar, Sarthak Consul, Sahil Shah
Publikováno v:
CDC
Policy Iteration (PI) is a classical family of algorithms to compute an optimal policy for any given Markov Decision Problem (MDP). The basic idea in PI is to begin with some initial policy and to repeatedly update the policy to one from an improving