Výsledky vyhledávání - "Bulatov, Aydar"

Report

Long Input Benchmark for Russian Analysis

Autor: Churin, Igor, Apishev, Murat, Tikhonova, Maria, Shevelev, Denis, Bulatov, Aydar, Kuratov, Yuri, Averkiev, Sergej, Fenogenova, Alena

Recent advancements in Natural Language Processing (NLP) have fostered the development of Large Language Models (LLMs) that can solve an immense variety of tasks. One of the key aspects of their application is their ability to work with long text doc

Externí odkaz: http://arxiv.org/abs/2408.02439

Zobrazit plný text záznamu

Report

Associative Recurrent Memory Transformer

Autor: Rodkin, Ivan, Kuratov, Yuri, Bulatov, Aydar, Burtsev, Mikhail

This paper addresses the challenge of creating a neural architecture for very long sequences that requires constant time for processing new information at each time step. Our approach, Associative Recurrent Memory Transformer (ARMT), is based on tran

Externí odkaz: http://arxiv.org/abs/2407.04841

Zobrazit plný text záznamu

Report

BABILong: Testing the Limits of LLMs with Long Context Reasoning-in-a-Haystack

Autor: Kuratov, Yuri, Bulatov, Aydar, Anokhin, Petr, Rodkin, Ivan, Sorokin, Dmitry, Sorokin, Artyom, Burtsev, Mikhail

In recent years, the input context sizes of large language models (LLMs) have increased dramatically. However, existing evaluation methods have not kept pace, failing to comprehensively assess the efficiency of models in handling long contexts. To br

Externí odkaz: http://arxiv.org/abs/2406.10149

Zobrazit plný text záznamu

Report

In Search of Needles in a 11M Haystack: Recurrent Memory Finds What LLMs Miss

Autor: Kuratov, Yuri, Bulatov, Aydar, Anokhin, Petr, Sorokin, Dmitry, Sorokin, Artyom, Burtsev, Mikhail

This paper addresses the challenge of processing long documents using generative transformer models. To evaluate different approaches, we introduce BABILong, a new benchmark designed to assess model capabilities in extracting and processing distribut

Externí odkaz: http://arxiv.org/abs/2402.10790

Zobrazit plný text záznamu

Report

Better Together: Enhancing Generative Knowledge Graph Completion with Language Models and Neighborhood Information

Autor: Chepurova, Alla, Bulatov, Aydar, Kuratov, Yuri, Burtsev, Mikhail

Real-world Knowledge Graphs (KGs) often suffer from incompleteness, which limits their potential performance. Knowledge Graph Completion (KGC) techniques aim to address this issue. However, traditional KGC methods are computationally intensive and im

Externí odkaz: http://arxiv.org/abs/2311.01326

Zobrazit plný text záznamu

Report

Scaling Transformer to 1M tokens and beyond with RMT

Autor: Bulatov, Aydar, Kuratov, Yuri, Kapushev, Yermek, Burtsev, Mikhail S.

A major limitation for the broader scope of problems solvable by transformers is the quadratic scaling of computational complexity with input size. In this study, we investigate the recurrent memory augmentation of pre-trained transformer models to e

Externí odkaz: http://arxiv.org/abs/2304.11062

Zobrazit plný text záznamu

Report

Recurrent Memory Transformer

Autor: Bulatov, Aydar, Kuratov, Yuri, Burtsev, Mikhail S.

Transformer-based models show their effectiveness across multiple domains and tasks. The self-attention allows to combine information from all sequence elements into context-aware representations. However, global and local information has to be store

Externí odkaz: http://arxiv.org/abs/2207.06881

Zobrazit plný text záznamu

Vyhledávací nástroje:

Upřesnit hledání