Zobrazeno 1 - 10
of 26
pro vyhledávání: '"Ramírez‐Ruiz, Jorge"'
A general Markov decision process formalism for action-state entropy-regularized reward maximization
Previous work has separately addressed different forms of action, state and action-state entropy regularization, pure exploration and space occupation. These problems have become extremely relevant for regularization, generalization, speeding up lear
Externí odkaz:
http://arxiv.org/abs/2302.01098
Autor:
Ramírez-Ruiz, Jorge, Grytskyy, Dmytro, Mastrogiuseppe, Chiara, Habib, Yamen, Moreno-Bote, Rubén
Most theories of behavior posit that agents tend to maximize some form of reward or utility. However, animals very often move with curiosity and seem to be motivated in a reward-free manner. Here we abandon the idea of reward maximization, and propos
Externí odkaz:
http://arxiv.org/abs/2205.10316
When facing many options, we narrow down our focus to very few of them. Although behaviors like this can be a sign of heuristics, they can actually be optimal under limited cognitive resources. Here we study the problem of how to optimally allocate l
Externí odkaz:
http://arxiv.org/abs/2102.01597
Publikováno v:
Phys. Rev. B 96, 235201 (2017)
Recent theoretical work has established the presence of hidden spin and orbital textures in non-magnetic materials with inversion symmetry. Here, we propose that these textures can be detected by nuclear magnetic resonance (NMR) measurements carried
Externí odkaz:
http://arxiv.org/abs/1709.02376
Akademický článek
Tento výsledek nelze pro nepřihlášené uživatele zobrazit.
K zobrazení výsledku je třeba se přihlásit.
K zobrazení výsledku je třeba se přihlásit.
Publikováno v:
Proceedings of the National Academy of Sciences of the United States of America, 2020 Aug . 117(33), 19799-19808.
Externí odkaz:
https://www.jstor.org/stable/26968574
Publikováno v:
Phys. Rev. B 94, 115204 (2016)
Motivated by recent nuclear magnetic resonance (NMR) experiments, we present a microscopic sp3 tight-binding model calculation of the NMR shifts in bulk Bi2Se3, and Bi2Te3. We compute the contact, dipolar, orbital and core polarization contributions
Externí odkaz:
http://arxiv.org/abs/1602.02649
Akademický článek
Tento výsledek nelze pro nepřihlášené uživatele zobrazit.
K zobrazení výsledku je třeba se přihlásit.
K zobrazení výsledku je třeba se přihlásit.
Intrinsic motivation generates behaviors that do not necessarily lead to immediate reward, but help exploration and learning. Here we show that agents having the sole goal of maximizing occupancy of future actions and states, that is, moving and expl
Externí odkaz:
https://explore.openaire.eu/search/publication?articleId=doi_dedup___::0129313e51a249600118019cff2fc458
Publikováno v:
Physical Review B. Sep2016, Vol. 94 Issue 11, p1-1. 1p.