Výsledky vyhledávání

Report

Video as the New Language for Real-World Decision Making

Autor: Yang, Sherry, Walker, Jacob, Parker-Holder, Jack, Du, Yilun, Bruce, Jake, Barreto, Andre, Abbeel, Pieter, Schuurmans, Dale

Both text and video data are abundant on the internet and support large-scale self-supervised learning through next token or frame prediction. However, they have not been equally leveraged: language models have had significant real-world impact, wher

Externí odkaz: http://arxiv.org/abs/2402.17139

Zobrazit plný text záznamu

Report

Genie: Generative Interactive Environments

We introduce Genie, the first generative interactive environment trained in an unsupervised manner from unlabelled Internet videos. The model can be prompted to generate an endless variety of action-controllable virtual worlds described through text,

Externí odkaz: http://arxiv.org/abs/2402.15391

Zobrazit plný text záznamu

Report

A Generalist Dynamics Model for Control

Autor: Schubert, Ingmar, Zhang, Jingwei, Bruce, Jake, Bechtle, Sarah, Parisotto, Emilio, Riedmiller, Martin, Springenberg, Jost Tobias, Byravan, Arunkumar, Hasenclever, Leonard, Heess, Nicolas

We investigate the use of transformer sequence models as dynamics models (TDMs) for control. We find that TDMs exhibit strong generalization capabilities to unseen environments, both in a few-shot setting, where a generalist TDM is fine-tuned with sm

Externí odkaz: http://arxiv.org/abs/2305.10912

Zobrazit plný text záznamu

Report

Accelerating exploration and representation learning with offline pre-training

Autor: Mazoure, Bogdan, Bruce, Jake, Precup, Doina, Fergus, Rob, Anand, Ankit

Sequential decision-making agents struggle with long horizon tasks, since solving them requires multi-step reasoning. Most reinforcement learning (RL) algorithms address this challenge by improved credit assignment, introducing memory capability, alt

Externí odkaz: http://arxiv.org/abs/2304.00046

Zobrazit plný text záznamu

Report

A Generalist Agent

Publikováno v: Transactions on Machine Learning Research, 11/2022, https://openreview.net/forum?id=1ikK0kHjvj

Inspired by progress in large-scale language modeling, we apply a similar approach towards building a single generalist agent beyond the realm of text outputs. The agent, which we refer to as Gato, works as a multi-modal, multi-task, multi-embodiment

Externí odkaz: http://arxiv.org/abs/2205.06175

Zobrazit plný text záznamu

Report

Imitation by Predicting Observations

Autor: Jaegle, Andrew, Sulsky, Yury, Ahuja, Arun, Bruce, Jake, Fergus, Rob, Wayne, Greg

Imitation learning enables agents to reuse and adapt the hard-won expertise of others, offering a solution to several key challenges in learning behavior. Although it is easy to observe behavior in the real-world, the underlying actions may not be ac

Externí odkaz: http://arxiv.org/abs/2107.03851

Zobrazit plný text záznamu

Report

Evaluating task-agnostic exploration for fixed-batch learning of arbitrary future tasks

Autor: Dasagi, Vibhavari, Lee, Robert, Bruce, Jake, Leitner, Jürgen

Deep reinforcement learning has been shown to solve challenging tasks where large amounts of training experience is available, usually obtained online while learning the task. Robotics is a significant potential application domain for many of these a

Externí odkaz: http://arxiv.org/abs/1911.08666

Zobrazit plný text záznamu

Report

Ctrl-Z: Recovering from Instability in Reinforcement Learning

Autor: Dasagi, Vibhavari, Bruce, Jake, Peynot, Thierry, Leitner, Jürgen

When learning behavior, training data is often generated by the learner itself; this can result in unstable training dynamics, and this problem has particularly important applications in safety-sensitive real-world control tasks such as robotics. In

Externí odkaz: http://arxiv.org/abs/1910.03732

Zobrazit plný text záznamu

Report

Sim-to-Real Transfer of Robot Learning with Variable Length Inputs

Autor: Dasagi, Vibhavari, Lee, Robert, Mou, Serena, Bruce, Jake, Sünderhauf, Niko, Leitner, Jürgen

Current end-to-end deep Reinforcement Learning (RL) approaches require jointly learning perception, decision-making and low-level control from very sparse reward signals and high-dimensional inputs, with little capability of incorporating prior knowl

Externí odkaz: http://arxiv.org/abs/1809.07480

Zobrazit plný text záznamu

Report

Learning Deployable Navigation Policies at Kilometer Scale from a Single Traversal

Autor: Bruce, Jake, Sünderhauf, Niko, Mirowski, Piotr, Hadsell, Raia, Milford, Michael

Model-free reinforcement learning has recently been shown to be effective at learning navigation policies from complex image input. However, these algorithms tend to require large amounts of interaction with the environment, which can be prohibitivel

Externí odkaz: http://arxiv.org/abs/1807.05211

Zobrazit plný text záznamu

Vyhledávací nástroje:

Upřesnit hledání