Suggestion of probabilistic reward-independent knowledge for dynamic environment in reinforcement learning.
Autor: | Shibuya, N., Miyazaki, Y., Kurashige, K. |
---|---|
Zdroj: | 2011 International Symposium on Micro-NanoMechatronics & Human Science (MHS); 2011, p140-145, 6p |
Databáze: | Complementary Index |
Externí odkaz: |