Výsledky vyhledávání

Report

N-Gram Induction Heads for In-Context RL: Improving Stability and Reducing Data Needs

Autor: Zisman, Ilya, Nikulin, Alexander, Polubarov, Andrei, Lyubaykin, Nikita, Kurenkov, Vladislav

In-context learning allows models like transformers to adapt to new tasks from a few examples without updating their weights, a desirable trait for reinforcement learning (RL). However, existing in-context RL methods, such as Algorithm Distillation (

Externí odkaz: http://arxiv.org/abs/2411.01958

Zobrazit plný text záznamu

Report

XLand-100B: A Large-Scale Multi-Task Dataset for In-Context Reinforcement Learning

Autor: Nikulin, Alexander, Zisman, Ilya, Zemtsov, Alexey, Sinii, Viacheslav, Kurenkov, Vladislav, Kolesnikov, Sergey

Following the success of the in-context learning paradigm in large-scale language and computer vision models, the recently emerging field of in-context reinforcement learning is experiencing a rapid growth. However, its development has been held back

Externí odkaz: http://arxiv.org/abs/2406.08973

Zobrazit plný text záznamu

Report

TORAX: A Fast and Differentiable Tokamak Transport Simulator in JAX

Autor: Citrin, Jonathan, Goodfellow, Ian, Raju, Akhil, Chen, Jeremy, Degrave, Jonas, Donner, Craig, Felici, Federico, Hamel, Philippe, Huber, Andrea, Nikulin, Dmitry, Pfau, David, Tracey, Brendan, Riedmiller, Martin, Kohli, Pushmeet

We present TORAX, a new, open-source, differentiable tokamak core transport simulator implemented in Python using the JAX framework. TORAX solves the coupled equations for ion heat transport, electron heat transport, particle transport, and current d

Externí odkaz: http://arxiv.org/abs/2406.06718

Zobrazit plný text záznamu

Report

EEG-Features for Generalized Deepfake Detection

Autor: Beckmann, Arian, Stephani, Tilman, Klotzsche, Felix, Chen, Yonghao, Hofmann, Simon M., Villringer, Arno, Gaebler, Michael, Nikulin, Vadim, Bosse, Sebastian, Eisert, Peter, Hilsmann, Anna

Since the advent of Deepfakes in digital media, the development of robust and reliable detection mechanism is urgently called for. In this study, we explore a novel approach to Deepfake detection by utilizing electroencephalography (EEG) measured fro

Externí odkaz: http://arxiv.org/abs/2405.08527

Zobrazit plný text záznamu

Report

Open RL Benchmark: Comprehensive Tracked Experiments for Reinforcement Learning

In many Reinforcement Learning (RL) papers, learning curves are useful indicators to measure the effectiveness of RL algorithms. However, the complete raw data of the learning curves are rarely available. As a result, it is usually necessary to repro

Externí odkaz: http://arxiv.org/abs/2402.03046

Zobrazit plný text záznamu

Report

In-Context Reinforcement Learning for Variable Action Spaces

Autor: Sinii, Viacheslav, Nikulin, Alexander, Kurenkov, Vladislav, Zisman, Ilya, Kolesnikov, Sergey

Recently, it has been shown that transformers pre-trained on diverse datasets with multi-episode contexts can generalize to new reinforcement learning tasks in-context. A key limitation of previously proposed models is their reliance on a predefined

Externí odkaz: http://arxiv.org/abs/2312.13327

Zobrazit plný text záznamu

Report

Emergence of In-Context Reinforcement Learning from Noise Distillation

Autor: Zisman, Ilya, Kurenkov, Vladislav, Nikulin, Alexander, Sinii, Viacheslav, Kolesnikov, Sergey

Recently, extensive studies in Reinforcement Learning have been carried out on the ability of transformers to adapt in-context to various environments and tasks. Current in-context RL methods are limited by their strict requirements for data, which n

Externí odkaz: http://arxiv.org/abs/2312.12275

Zobrazit plný text záznamu

Report

XLand-MiniGrid: Scalable Meta-Reinforcement Learning Environments in JAX

Autor: Nikulin, Alexander, Kurenkov, Vladislav, Zisman, Ilya, Agarkov, Artem, Sinii, Viacheslav, Kolesnikov, Sergey

Inspired by the diversity and depth of XLand and the simplicity and minimalism of MiniGrid, we present XLand-MiniGrid, a suite of tools and grid-world environments for meta-reinforcement learning research. Written in JAX, XLand-MiniGrid is designed t

Externí odkaz: http://arxiv.org/abs/2312.12044

Zobrazit plný text záznamu

Vyhledávací nástroje:

Upřesnit hledání