Zobrazeno 1 - 10
of 199
pro vyhledávání: '"Moreno-Bote Rubén"'
A general Markov decision process formalism for action-state entropy-regularized reward maximization
Previous work has separately addressed different forms of action, state and action-state entropy regularization, pure exploration and space occupation. These problems have become extremely relevant for regularization, generalization, speeding up lear
Externí odkaz:
http://arxiv.org/abs/2302.01098
Autor:
Ramírez-Ruiz, Jorge, Grytskyy, Dmytro, Mastrogiuseppe, Chiara, Habib, Yamen, Moreno-Bote, Rubén
Most theories of behavior posit that agents tend to maximize some form of reward or utility. However, animals very often move with curiosity and seem to be motivated in a reward-free manner. Here we abandon the idea of reward maximization, and propos
Externí odkaz:
http://arxiv.org/abs/2205.10316
Publikováno v:
In Current Biology 4 November 2024 34(21):4983-4997
Autor:
Moreno-Bote Rubén, Parga Néstor
Publikováno v:
BMC Neuroscience, Vol 10, Iss Suppl 1, p P237 (2009)
Externí odkaz:
https://doaj.org/article/ae98ad581b3741459145e024432c76ec
Publikováno v:
BMC Neuroscience, Vol 8, Iss Suppl 2, p P78 (2007)
Externí odkaz:
https://doaj.org/article/724e6e8fca2a47aa92c403b87542299f
Autor:
Moreno-Bote Rubén, Parga Néstor
Publikováno v:
BMC Neuroscience, Vol 8, Iss Suppl 2, p P43 (2007)
Externí odkaz:
https://doaj.org/article/86af50213f57436e9d295c9e754cc97a
Many decisions involve choosing an uncertain course of actions in deep and wide decision trees, as when we plan to visit an exotic country for vacation. In these cases, exhaustive search for the best sequence of actions is not tractable due to the la
Externí odkaz:
http://arxiv.org/abs/2104.06339
When facing many options, we narrow down our focus to very few of them. Although behaviors like this can be a sign of heuristics, they can actually be optimal under limited cognitive resources. Here we study the problem of how to optimally allocate l
Externí odkaz:
http://arxiv.org/abs/2102.01597
Publikováno v:
Phys. Rev. E 100, 032132 (2019)
Diffusion processes with boundaries are models of transport phenomena with wide applicability across many fields. These processes are described by their probability density functions (PDFs), which often obey Fokker-Planck equations (FPEs). While obta
Externí odkaz:
http://arxiv.org/abs/1907.03341
Akademický článek
Tento výsledek nelze pro nepřihlášené uživatele zobrazit.
K zobrazení výsledku je třeba se přihlásit.
K zobrazení výsledku je třeba se přihlásit.