Výsledky vyhledávání

Report

Intra-agent speech permits zero-shot task acquisition

Autor: Yan, Chen, Carnevale, Federico, Georgiev, Petko, Santoro, Adam, Guy, Aurelia, Muldal, Alistair, Hung, Chia-Chun, Abramson, Josh, Lillicrap, Timothy, Wayne, Gregory

Human language learners are exposed to a trickle of informative, context-sensitive language, but a flood of raw sensory data. Through both social language use and internal processes of rehearsal and practice, language learners are able to build high-

Externí odkaz: http://arxiv.org/abs/2206.03139

Zobrazit plný text záznamu

Report

Evaluating Multimodal Interactive Agents

Autor: Abramson, Josh, Ahuja, Arun, Carnevale, Federico, Georgiev, Petko, Goldin, Alex, Hung, Alden, Landon, Jessica, Lillicrap, Timothy, Muldal, Alistair, Richards, Blake, Santoro, Adam, von Glehn, Tamara, Wayne, Greg, Wong, Nathaniel, Yan, Chen

Creating agents that can interact naturally with humans is a common goal in artificial intelligence (AI) research. However, evaluating these interactions is challenging: collecting online human-agent interactions is slow and expensive, yet faster pro

Externí odkaz: http://arxiv.org/abs/2205.13274

Zobrazit plný text záznamu

Report

A data-driven approach for learning to control computers

Autor: Humphreys, Peter C, Raposo, David, Pohlen, Toby, Thornton, Gregory, Chhaparia, Rachita, Muldal, Alistair, Abramson, Josh, Georgiev, Petko, Goldin, Alex, Santoro, Adam, Lillicrap, Timothy

Publikováno v: Proceedings of the 39th International Conference on Machine Learning, Baltimore, Maryland, USA, PMLR 162, 2022

It would be useful for machines to use computers as humans do so that they can aid us in everyday tasks. This is a setting in which there is also the potential to leverage large-scale expert demonstrations and human judgements of interactive behaviou

Externí odkaz: http://arxiv.org/abs/2202.08137

Zobrazit plný text záznamu

Report

Creating Multimodal Interactive Agents with Imitation and Self-Supervised Learning

A common vision from science fiction is that robots will one day inhabit our physical spaces, sense the world as we do, assist our physical labours, and communicate with us through natural language. Here we study how to design artificial agents that

Externí odkaz: http://arxiv.org/abs/2112.03763

Zobrazit plný text záznamu

Report

Imitating Interactive Intelligence

Externí odkaz: http://arxiv.org/abs/2012.05672

Zobrazit plný text záznamu

Report

Physically Embedded Planning Problems: New Challenges for Reinforcement Learning

Autor: Mirza, Mehdi, Jaegle, Andrew, Hunt, Jonathan J., Guez, Arthur, Tunyasuvunakool, Saran, Muldal, Alistair, Weber, Théophane, Karkus, Peter, Racanière, Sébastien, Buesing, Lars, Lillicrap, Timothy, Heess, Nicolas

Recent work in deep reinforcement learning (RL) has produced algorithms capable of mastering challenging games such as Go, chess, or shogi. In these works the RL agent directly observes the natural state of the game and controls that state directly w

Externí odkaz: http://arxiv.org/abs/2009.05524

Zobrazit plný text záznamu

Report

dm_control: Software and Tasks for Continuous Control

Autor: Tassa, Yuval, Tunyasuvunakool, Saran, Muldal, Alistair, Doron, Yotam, Trochim, Piotr, Liu, Siqi, Bohez, Steven, Merel, Josh, Erez, Tom, Lillicrap, Timothy, Heess, Nicolas

The dm_control software package is a collection of Python libraries and task suites for reinforcement learning agents in an articulated-body simulation. A MuJoCo wrapper provides convenient bindings to functions and data structures. The PyMJCF and Co

Externí odkaz: http://arxiv.org/abs/2006.12983

Zobrazit plný text záznamu

Report

Distributed Distributional Deterministic Policy Gradients

Autor: Barth-Maron, Gabriel, Hoffman, Matthew W., Budden, David, Dabney, Will, Horgan, Dan, TB, Dhruva, Muldal, Alistair, Heess, Nicolas, Lillicrap, Timothy

This work adopts the very successful distributional perspective on reinforcement learning and adapts it to the continuous control setting. We combine this within a distributed framework for off-policy learning in order to develop what we call the Dis

Externí odkaz: http://arxiv.org/abs/1804.08617

Zobrazit plný text záznamu

Report

Learning Awareness Models

Autor: Amos, Brandon, Dinh, Laurent, Cabi, Serkan, Rothörl, Thomas, Colmenarejo, Sergio Gómez, Muldal, Alistair, Erez, Tom, Tassa, Yuval, de Freitas, Nando, Denil, Misha

We consider the setting of an agent with a fixed body interacting with an unknown and uncertain external world. We show that models trained to predict proprioceptive information about the agent's body come to represent objects in the external world.

Externí odkaz: http://arxiv.org/abs/1804.06318

Zobrazit plný text záznamu

Vyhledávací nástroje:

Upřesnit hledání