Výsledky vyhledávání - "Veness, Joel"

Report

Compression via Pre-trained Transformers: A Study on Byte-Level Multimodal Data

Autor: Heurtel-Depeiges, David, Ruoss, Anian, Veness, Joel, Genewein, Tim

Foundation models have recently been shown to be strong data compressors. However, when accounting for their excessive parameter count, their compression ratios are actually inferior to standard compression algorithms. Moreover, naively reducing the

Externí odkaz: http://arxiv.org/abs/2410.05078

Zobrazit plný text záznamu

Report

Learning Universal Predictors

Autor: Grau-Moya, Jordi, Genewein, Tim, Hutter, Marcus, Orseau, Laurent, Delétang, Grégoire, Catt, Elliot, Ruoss, Anian, Wenliang, Li Kevin, Mattern, Christopher, Aitchison, Matthew, Veness, Joel

Meta-learning has emerged as a powerful approach to train neural networks to learn new tasks quickly from limited data. Broad exposure to different tasks leads to versatile representations enabling general problem solving. But, what are the limits of

Externí odkaz: http://arxiv.org/abs/2401.14953

Zobrazit plný text záznamu

Report

Language Modeling Is Compression

Autor: Delétang, Grégoire, Ruoss, Anian, Duquenne, Paul-Ambroise, Catt, Elliot, Genewein, Tim, Mattern, Christopher, Grau-Moya, Jordi, Wenliang, Li Kevin, Aitchison, Matthew, Orseau, Laurent, Hutter, Marcus, Veness, Joel

It has long been established that predictive models can be transformed into lossless compressors and vice versa. Incidentally, in recent years, the machine learning community has focused on training increasingly large and powerful self-supervised (la

Externí odkaz: http://arxiv.org/abs/2309.10668

Zobrazit plný text záznamu

Report

Randomized Positional Encodings Boost Length Generalization of Transformers

Autor: Ruoss, Anian, Delétang, Grégoire, Genewein, Tim, Grau-Moya, Jordi, Csordás, Róbert, Bennani, Mehdi, Legg, Shane, Veness, Joel

Transformers have impressive generalization capabilities on tasks with a fixed context length. However, they fail to generalize to sequences of arbitrary length, even for seemingly simple tasks such as duplicating a string. Moreover, simply training

Externí odkaz: http://arxiv.org/abs/2305.16843

Zobrazit plný text záznamu

Report

Memory-Based Meta-Learning on Non-Stationary Distributions

Autor: Genewein, Tim, Delétang, Grégoire, Ruoss, Anian, Wenliang, Li Kevin, Catt, Elliot, Dutordoir, Vincent, Grau-Moya, Jordi, Orseau, Laurent, Hutter, Marcus, Veness, Joel

Memory-based meta-learning is a technique for approximating Bayes-optimal predictors. Under fairly general conditions, minimizing sequential prediction error, measured by the log loss, leads to implicit meta-learning. The goal of this work is to inve

Externí odkaz: http://arxiv.org/abs/2302.03067

Zobrazit plný text záznamu

Report

Beyond Bayes-optimality: meta-learning what you know you don't know

Autor: Grau-Moya, Jordi, Delétang, Grégoire, Kunesch, Markus, Genewein, Tim, Catt, Elliot, Li, Kevin, Ruoss, Anian, Cundy, Chris, Veness, Joel, Wang, Jane, Hutter, Marcus, Summerfield, Christopher, Legg, Shane, Ortega, Pedro

Meta-training agents with memory has been shown to culminate in Bayes-optimal agents, which casts Bayes-optimality as the implicit solution to a numerical optimization problem rather than an explicit modeling assumption. Bayes-optimal agents are risk

Externí odkaz: http://arxiv.org/abs/2209.15618

Zobrazit plný text záznamu

Report

Neural Networks and the Chomsky Hierarchy

Autor: Delétang, Grégoire, Ruoss, Anian, Grau-Moya, Jordi, Genewein, Tim, Wenliang, Li Kevin, Catt, Elliot, Cundy, Chris, Hutter, Marcus, Legg, Shane, Veness, Joel, Ortega, Pedro A.

Reliable generalization lies at the heart of safe ML and AI. However, understanding when and how neural networks generalize remains one of the most important unsolved problems in the field. In this work, we conduct an extensive empirical study (20'91

Externí odkaz: http://arxiv.org/abs/2207.02098

Zobrazit plný text záznamu

Report

Shaking the foundations: delusions in sequence models for interaction and control

Autor: Ortega, Pedro A., Kunesch, Markus, Delétang, Grégoire, Genewein, Tim, Grau-Moya, Jordi, Veness, Joel, Buchli, Jonas, Degrave, Jonas, Piot, Bilal, Perolat, Julien, Everitt, Tom, Tallec, Corentin, Parisotto, Emilio, Erez, Tom, Chen, Yutian, Reed, Scott, Hutter, Marcus, de Freitas, Nando, Legg, Shane

The recent phenomenal success of language models has reinvigorated machine learning research, and large sequence models such as transformers are being applied to a variety of domains. One important problem class that has remained relatively elusive h

Externí odkaz: http://arxiv.org/abs/2110.10819

Zobrazit plný text záznamu

Report

Reinforcement Learning with Information-Theoretic Actuation

Autor: Catt, Elliot, Hutter, Marcus, Veness, Joel

Reinforcement Learning formalises an embodied agent's interaction with the environment through observations, rewards and actions. But where do the actions come from? Actions are often considered to represent something external, such as the movement o

Externí odkaz: http://arxiv.org/abs/2109.15147

Zobrazit plný text záznamu

Report

A Combinatorial Perspective on Transfer Learning

Autor: Wang, Jianan, Sezener, Eren, Budden, David, Hutter, Marcus, Veness, Joel

Human intelligence is characterized not only by the capacity to learn complex skills, but the ability to rapidly adapt and acquire new skills within an ever-changing environment. In this work we study how the learning of modular solutions can allow f

Externí odkaz: http://arxiv.org/abs/2010.12268

Zobrazit plný text záznamu

Vyhledávací nástroje:

Upřesnit hledání