Iterative Model Refinement of Recommender MDPs Based on Expert Feedback

Autor: Pascal Poupart, Omar Zia Khan, John Mark Agosta
Rok vydání: 2013
Předmět:
Zdroj: Advanced Information Systems Engineering ISBN: 9783642387081
ECML/PKDD (1)
DOI: 10.1007/978-3-642-40988-2_11
Popis: In this paper, we present a method to iteratively refine the parameters of a Markov Decision Process by leveraging constraints implied from an expert's review of the policy. We impose a constraint on the parameters of the model for every case where the expert's recommendation differs from the recommendation of the policy. We demonstrate that consistency with an expert's feedback leads to non-convex constraints on the model parameters. We refine the parameters of the model, under these constraints, by partitioning the parameter space and iteratively applying alternating optimization. We demonstrate how the approach can be applied to both flat and factored MDPs and present results based on diagnostic sessions from a manufacturing scenario.
Databáze: OpenAIRE