Analysis for Adaptability of Policy-Improving System with a Mixture Model of Bayesian Networks to Dynamic Environments.

Autor: Khosla, Rajiv, Howlett, Robert J., Jain, Lakhmi C., Kitakoshi, Daisuke, Shioya, Hiroyuki, Nakano, Ryohei
Zdroj: Knowledge-Based Intelligent Information & Engineering Systems (9783540288978); 2005, p730-737, 8p
Abstrakt: We have proposed an online policy-improving system of reinforcement learning (RL) agents with a mixture model of Bayesian Networks (BNs), and discussed properties of the system. In this paper, two types of mixture models have been applied to the system. A structure of BN in the mixture model is selected based on data collected by agents in an environment, and is regarded as a stochastic knowledge of the environment. This research investigates the adaptability of our system to dynamic environments containing an unexperienced environment, in which an agent does not have the knowledge. [ABSTRACT FROM AUTHOR]
Databáze: Supplemental Index