Výsledky vyhledávání - "Bharadwaj, Sudarshanan"

Report

Temporal-Logic-Based Reward Shaping for Continuing Reinforcement Learning Tasks

Autor: Jiang, Yuqian, Bharadwaj, Sudarshanan, Wu, Bo, Shah, Rishi, Topcu, Ufuk, Stone, Peter

In continuing tasks, average-reward reinforcement learning may be a more appropriate problem formulation than the more common discounted reward formulation. As usual, learning an optimal policy in this setting typically requires a large amount of tra

Externí odkaz: http://arxiv.org/abs/2007.01498

Zobrazit plný text záznamu

Temporal-Logic-Based Reward Shaping for Continuing Reinforcement Learning Tasks

Autor: Jiang, Yuqian, Bharadwaj, Sudarshanan, Wu, Bo, Shah, Rishi, Topcu, Ufuk, Stone, Peter

Publikováno v: Proceedings of the AAAI Conference on Artificial Intelligence. 35:7995-8003

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=doi_dedup___::15e486d63412e86c3077bd0d82c45004
https://doi.org/10.1609/aaai.v35i9.16975

Zobrazit plný text záznamu

Assured decison-making for autonomous systems

Autor: Bharadwaj, Sudarshanan, 0000-0003-3045-8584

As autonomous systems become more widely used in society, they will necessarily have to make more decisions in order to meet increasingly complex objectives. However, to facilitate greater deployment of autonomous systems, especially in safety-critic

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=doi_________::72d373acefa06cd52e635609a360ddbc

Zobrazit plný text záznamu

Akademický článek

Tento výsledek nelze pro nepřihlášené uživatele zobrazit.
K zobrazení výsledku je třeba se přihlásit.

Vyhledávací nástroje:

Upřesnit hledání