Výsledky vyhledávání - "Matej Vecerik"

Deep Q-learning from Demonstrations

Autor: Todd Hester, Matej Vecerik, Olivier Pietquin, Marc Lanctot, Tom Schaul, Bilal Piot, Dan Horgan, John Quan, Andrew Sendonaris, Ian Osband, Gabriel Dulac-Arnold, John Agapiou, Joel Leibo, Audrunas Gruslys

Deep reinforcement learning (RL) has achieved several high profile successes in difficult decision-making problems. However, these algorithms typically require a huge amount of data before they reach reasonable performance. In fact, their performance

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=doi_dedup___::d2603999dd3fc6b0bb5f7dadefd67bf1
http://arxiv.org/abs/1704.03732

Zobrazit plný text záznamu

Vyhledávací nástroje:

Upřesnit hledání