Výsledky vyhledávání - "Kujirai, Toshihiro"

Report

Robust Meta-Reinforcement Learning with Curriculum-Based Task Sampling

Autor: Matsumoto, Morio, Matsuba, Hiroya, Kujirai, Toshihiro

Meta-reinforcement learning (meta-RL) acquires meta-policies that show good performance for tasks in a wide task distribution. However, conventional meta-RL, which learns meta-policies by randomly sampling tasks, has been reported to show meta-overfi

Externí odkaz: http://arxiv.org/abs/2203.16801

Zobrazit plný text záznamu

Dynamic Workforce Scheduling and Routing in a Smart City Using Temporal Batch Decomposition

Autor: Hunabad Tejdeep Reddy, Rishabh Ranjan, Kujirai Toshihiro

Publikováno v: 2021 International Symposium on Computer Science and Intelligent Controls (ISCSIC).

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=doi_________::9fd13986fcd165881c3a8bf1c208fc2f
https://doi.org/10.1109/iscsic54682.2021.00056

Zobrazit plný text záznamu

スパースな干渉下での強化学習におけるグリーディな行動選択と悲観的なQ値の更新

Autor: Kujirai, Toshihiro

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=jairo_______::8fa9474acc913b39a008eae1c77d0676
http://repository.lib.tottori-u.ac.jp/6544

Zobrazit plný text záznamu

Periodical

Greedy Action Selection and Pessimistic Q-Value Updating in Multi-Agent Reinforcement Learning with Sparse Interaction

Autor: Kujirai, Toshihiro, Yokota, Takayoshi

Publikováno v: SICE Journal of Control, Measurement, and System Integration; May 2019, Vol. 12 Issue: 3 p76-84, 9p

Zobrazit plný text záznamu

Periodical

A spectral-spatial-dynamic hierarchical Bayesian (SSD-HB) model for estimating soybean yield

Autor: Neale, Christopher M. U., Maltese, Antonino, Kazama, Yoriko, Kujirai, Toshihiro

Publikováno v: Proceedings of SPIE; October 2014, Vol. 9239 Issue: 1 p92390X-92390X-7, 831518p

Zobrazit plný text záznamu

Vyhledávací nástroje:

Upřesnit hledání