Learning reward machines: A study in partially observable reinforcement learning
Autor: | Toro Icarte, Rodrigo, Klassen, Toryn Q., Valenzano, Richard, Castro, Margarita P., Waldie, Ethan, McIlraith, Sheila A. |
---|---|
Zdroj: | In Artificial Intelligence October 2023 323 |
Databáze: | ScienceDirect |
Externí odkaz: |