Výsledky vyhledávání - "Doron, Yotam"

Report

dm_control: Software and Tasks for Continuous Control

Autor: Tassa, Yuval, Tunyasuvunakool, Saran, Muldal, Alistair, Doron, Yotam, Trochim, Piotr, Liu, Siqi, Bohez, Steven, Merel, Josh, Erez, Tom, Lillicrap, Timothy, Heess, Nicolas

The dm_control software package is a collection of Python libraries and task suites for reinforcement learning agents in an articulated-body simulation. A MuJoCo wrapper provides convenient bindings to functions and data structures. The PyMJCF and Co

Externí odkaz: http://arxiv.org/abs/2006.12983

Zobrazit plný text záznamu

Report

Transformation-based Adversarial Video Prediction on Large-Scale Data

Autor: Luc, Pauline, Clark, Aidan, Dieleman, Sander, Casas, Diego de Las, Doron, Yotam, Cassirer, Albin, Simonyan, Karen

Recent breakthroughs in adversarial generative modeling have led to models capable of producing video samples of high quality, even on large and complex datasets of real-world video. In this work, we focus on the task of video prediction, where given

Externí odkaz: http://arxiv.org/abs/2003.04035

Zobrazit plný text záznamu

Report

Behaviour Suite for Reinforcement Learning

Autor: Osband, Ian, Doron, Yotam, Hessel, Matteo, Aslanides, John, Sezener, Eren, Saraiva, Andre, McKinney, Katrina, Lattimore, Tor, Szepesvari, Csaba, Singh, Satinder, Van Roy, Benjamin, Sutton, Richard, Silver, David, Van Hasselt, Hado

This paper introduces the Behaviour Suite for Reinforcement Learning, or bsuite for short. bsuite is a collection of carefully-designed experiments that investigate core capabilities of reinforcement learning (RL) agents with two objectives. First, t

Externí odkaz: http://arxiv.org/abs/1908.03568

Zobrazit plný text záznamu

Report

Deep Reinforcement Learning and the Deadly Triad

Autor: van Hasselt, Hado, Doron, Yotam, Strub, Florian, Hessel, Matteo, Sonnerat, Nicolas, Modayil, Joseph

We know from reinforcement learning theory that temporal difference learning can fail in certain cases. Sutton and Barto (2018) identify a deadly triad of function approximation, bootstrapping, and off-policy learning. When these three properties are

Externí odkaz: http://arxiv.org/abs/1812.02648

Zobrazit plný text záznamu

Report

IMPALA: Scalable Distributed Deep-RL with Importance Weighted Actor-Learner Architectures

Autor: Espeholt, Lasse, Soyer, Hubert, Munos, Remi, Simonyan, Karen, Mnih, Volodymir, Ward, Tom, Doron, Yotam, Firoiu, Vlad, Harley, Tim, Dunning, Iain, Legg, Shane, Kavukcuoglu, Koray

In this work we aim to solve a large collection of tasks using a single reinforcement learning agent with a single set of parameters. A key challenge is to handle the increased amount of data and extended training time. We have developed a new distri

Externí odkaz: http://arxiv.org/abs/1802.01561

Zobrazit plný text záznamu

Report

DeepMind Control Suite

Autor: Tassa, Yuval, Doron, Yotam, Muldal, Alistair, Erez, Tom, Li, Yazhe, Casas, Diego de Las, Budden, David, Abdolmaleki, Abbas, Merel, Josh, Lefrancq, Andrew, Lillicrap, Timothy, Riedmiller, Martin

The DeepMind Control Suite is a set of continuous control tasks with a standardised structure and interpretable rewards, intended to serve as performance benchmarks for reinforcement learning agents. The tasks are written in Python and powered by the

Externí odkaz: http://arxiv.org/abs/1801.00690

Zobrazit plný text záznamu

dm_control: Software and tasks for continuous control

Autor: Tunyasuvunakool, Saran, Muldal, Alistair, Doron, Yotam, Liu, Siqi, Bohez, Steven, Merel, Josh, Erez, Tom, Lillicrap, Timothy, Heess, Nicolas, Tassa, Yuval

Publikováno v: In Software Impacts November 2020 6

Zobrazit plný text záznamu

Kniha

User Directed Multi-view-stereo.

Autor: Doron, Yotam, Campbell, Neill D. F., Starck, Jonathan, Kautz, Jan

Publikováno v: Computer Vision - ACCV 2014 Workshops: Singapore, Singapore, November 1-2, 2014, Revised Selected Papers, Part II; 2015, p299-313, 15p

Zobrazit plný text záznamu

Vyhledávací nástroje:

Upřesnit hledání