Zobrazeno 1 - 8
of 8
pro vyhledávání: '"Doron, Yotam"'
Autor:
Tassa, Yuval, Tunyasuvunakool, Saran, Muldal, Alistair, Doron, Yotam, Trochim, Piotr, Liu, Siqi, Bohez, Steven, Merel, Josh, Erez, Tom, Lillicrap, Timothy, Heess, Nicolas
The dm_control software package is a collection of Python libraries and task suites for reinforcement learning agents in an articulated-body simulation. A MuJoCo wrapper provides convenient bindings to functions and data structures. The PyMJCF and Co
Externí odkaz:
http://arxiv.org/abs/2006.12983
Autor:
Luc, Pauline, Clark, Aidan, Dieleman, Sander, Casas, Diego de Las, Doron, Yotam, Cassirer, Albin, Simonyan, Karen
Recent breakthroughs in adversarial generative modeling have led to models capable of producing video samples of high quality, even on large and complex datasets of real-world video. In this work, we focus on the task of video prediction, where given
Externí odkaz:
http://arxiv.org/abs/2003.04035
Autor:
Osband, Ian, Doron, Yotam, Hessel, Matteo, Aslanides, John, Sezener, Eren, Saraiva, Andre, McKinney, Katrina, Lattimore, Tor, Szepesvari, Csaba, Singh, Satinder, Van Roy, Benjamin, Sutton, Richard, Silver, David, Van Hasselt, Hado
This paper introduces the Behaviour Suite for Reinforcement Learning, or bsuite for short. bsuite is a collection of carefully-designed experiments that investigate core capabilities of reinforcement learning (RL) agents with two objectives. First, t
Externí odkaz:
http://arxiv.org/abs/1908.03568
Autor:
van Hasselt, Hado, Doron, Yotam, Strub, Florian, Hessel, Matteo, Sonnerat, Nicolas, Modayil, Joseph
We know from reinforcement learning theory that temporal difference learning can fail in certain cases. Sutton and Barto (2018) identify a deadly triad of function approximation, bootstrapping, and off-policy learning. When these three properties are
Externí odkaz:
http://arxiv.org/abs/1812.02648
Autor:
Espeholt, Lasse, Soyer, Hubert, Munos, Remi, Simonyan, Karen, Mnih, Volodymir, Ward, Tom, Doron, Yotam, Firoiu, Vlad, Harley, Tim, Dunning, Iain, Legg, Shane, Kavukcuoglu, Koray
In this work we aim to solve a large collection of tasks using a single reinforcement learning agent with a single set of parameters. A key challenge is to handle the increased amount of data and extended training time. We have developed a new distri
Externí odkaz:
http://arxiv.org/abs/1802.01561
Autor:
Tassa, Yuval, Doron, Yotam, Muldal, Alistair, Erez, Tom, Li, Yazhe, Casas, Diego de Las, Budden, David, Abdolmaleki, Abbas, Merel, Josh, Lefrancq, Andrew, Lillicrap, Timothy, Riedmiller, Martin
The DeepMind Control Suite is a set of continuous control tasks with a standardised structure and interpretable rewards, intended to serve as performance benchmarks for reinforcement learning agents. The tasks are written in Python and powered by the
Externí odkaz:
http://arxiv.org/abs/1801.00690
Autor:
Tunyasuvunakool, Saran, Muldal, Alistair, Doron, Yotam, Liu, Siqi, Bohez, Steven, Merel, Josh, Erez, Tom, Lillicrap, Timothy, Heess, Nicolas, Tassa, Yuval
Publikováno v:
In Software Impacts November 2020 6
Publikováno v:
Computer Vision - ACCV 2014 Workshops: Singapore, Singapore, November 1-2, 2014, Revised Selected Papers, Part II; 2015, p299-313, 15p