Zobrazeno 1 - 7
of 7
pro vyhledávání: '"Jeffery Dick"'
Autor:
Jeffery Dick, Pawel Ladosz, Eseoghene Ben-Iwhiwhu, Hideyasu Shimadzu, Peter Kinnell, Praveen K. Pilly, Soheil Kolouri, Andrea Soltoggio
Publikováno v:
Frontiers in Neurorobotics, Vol 14 (2020)
The ability of an agent to detect changes in an environment is key to successful adaptation. This ability involves at least two phases: learning a model of an environment, and detecting that a change is likely to have occurred when this model is no l
Externí odkaz:
https://doaj.org/article/2c84bfbbe7d7490ca7cd600ac3209d18
Publikováno v:
Language and Automata Theory and Applications
The Parikh matrix mapping allows us to describe words using matrices. Although compact, this description comes with a level of ambiguity since a single matrix may describe multiple words. This work looks at how considering the Parikh matrices of vari
Autor:
Eseoghene Ben-Iwhiwhu, Pawel Ladosz, Nicholas A. Ketz, Soheil Kolouri, Jeffrey L. Krichmar, Praveen K. Pilly, Jeffery Dick, Andrea Soltoggio
Publikováno v:
IEEE transactions on neural networks and learning systems. 33(5)
In this article, we consider a subclass of partially observable Markov decision process (POMDP) problems which we termed confounding POMDPs. In these types of POMDPs, temporal difference (TD)-based reinforcement learning (RL) algorithms struggle, as
Meta-reinforcement learning (meta-RL) algorithms enable agents to adapt quickly to tasks from few samples in dynamic environments. Such a feat is achieved through dynamic representations in an agent's policy network (obtained via reasoning about task
Externí odkaz:
https://explore.openaire.eu/search/publication?articleId=doi_dedup___::e00469d84c57593dfde86817eb4e46ce
Autor:
Hideyasu Shimadzu, Eseoghene Ben-Iwhiwhu, Pawel Ladosz, Andrea Soltoggio, Praveen K. Pilly, Jeffery Dick, Peter Kinnell, Soheil Kolouri
Publikováno v:
Frontiers in Neurorobotics, Vol 14 (2020)
Frontiers in Neurorobotics
Frontiers in Neurorobotics
The ability of an agent to detect changes in an environment is key to successful adaptation. This ability involves at least two phases: learning a model of an environment, and detecting that a change is likely to have occurred when this model is no l
Autor:
Pawel Ladosz, Praveen K. Pilly, Eseoghene Ben-Iwhiwhu, Wen-Hua Chen, Andrea Soltoggio, Jeffery Dick
Publikováno v:
GECCO
Rapid online adaptation to changing tasks is an important problem in machine learning and, recently, a focus of meta-reinforcement learning. However, reinforcement learning (RL) algorithms struggle in POMDP environments because the state of the syste
Externí odkaz:
https://explore.openaire.eu/search/publication?articleId=doi_dedup___::617ee1d8533ef74b353759128e7fe511
http://arxiv.org/abs/2004.12846
http://arxiv.org/abs/2004.12846
Publikováno v:
Language and Automata Theory and Applications ISBN: 9783030406073
LATA
LATA
The Parikh matrix mapping allows us to describe words using matrices. Although compact, this description comes with a level of ambiguity since a single matrix may describe multiple words. This work looks at how considering the Parikh matrices of vari
Externí odkaz:
https://explore.openaire.eu/search/publication?articleId=doi_________::72b602a28394c039e12c3f7aaaa57dd3
https://doi.org/10.1007/978-3-030-40608-0_28
https://doi.org/10.1007/978-3-030-40608-0_28