Výsledky vyhledávání

Report

LLMBox: A Comprehensive Library for Large Language Models

To facilitate the research on large language models (LLMs), this paper presents a comprehensive and unified library, LLMBox, to ease the development, use, and evaluation of LLMs. This library is featured with three main merits: (1) a unified data int

Externí odkaz: http://arxiv.org/abs/2407.05563

Zobrazit plný text záznamu

Report

YuLan: An Open-source Large Language Model

Large language models (LLMs) have become the foundation of many applications, leveraging their extensive capabilities in processing and understanding natural language. While many open-source LLMs have been released with technical reports, the lack of

Externí odkaz: http://arxiv.org/abs/2406.19853

Zobrazit plný text záznamu

Report

Exploring Context Window of Large Language Models via Decomposed Positional Vectors

Autor: Dong, Zican, Li, Junyi, Men, Xin, Zhao, Wayne Xin, Wang, Bingbing, Tian, Zhen, Chen, Weipeng, Wen, Ji-Rong

Transformer-based large language models (LLMs) typically have a limited context window, resulting in significant performance degradation when processing text beyond the length of the context window. Extensive studies have been proposed to extend the

Externí odkaz: http://arxiv.org/abs/2405.18009

Zobrazit plný text záznamu

Report

BAMBOO: A Comprehensive Benchmark for Evaluating Long Text Modeling Capacities of Large Language Models

Autor: Dong, Zican, Tang, Tianyi, Li, Junyi, Zhao, Wayne Xin, Wen, Ji-Rong

Large language models (LLMs) have achieved dramatic proficiency over NLP tasks with normal length. Recently, multiple studies have committed to extending the context length and enhancing the long text modeling capabilities of LLMs. To comprehensively

Externí odkaz: http://arxiv.org/abs/2309.13345

Zobrazit plný text záznamu

Report

StructGPT: A General Framework for Large Language Model to Reason over Structured Data

Autor: Jiang, Jinhao, Zhou, Kun, Dong, Zican, Ye, Keming, Zhao, Wayne Xin, Wen, Ji-Rong

In this paper, we study how to improve the zero-shot reasoning ability of large language models~(LLMs) over structured data in a unified way. Inspired by the study on tool augmentation for LLMs, we develop an \emph{Iterative Reading-then-Reasoning~(I

Externí odkaz: http://arxiv.org/abs/2305.09645

Zobrazit plný text záznamu

Report

A Survey of Large Language Models

Language is essentially a complex, intricate system of human expressions governed by grammatical rules. It poses a significant challenge to develop capable AI algorithms for comprehending and grasping a language. As a major approach, language modelin

Externí odkaz: http://arxiv.org/abs/2303.18223

Zobrazit plný text záznamu

Report

A Survey on Long Text Modeling with Transformers

Autor: Dong, Zican, Tang, Tianyi, Li, Lunyi, Zhao, Wayne Xin

Modeling long texts has been an essential technique in the field of natural language processing (NLP). With the ever-growing number of long documents, it is important to develop effective modeling methods that can process and analyze such texts. Howe

Externí odkaz: http://arxiv.org/abs/2302.14502

Zobrazit plný text záznamu

Report

TextBox 2.0: A Text Generation Library with Pre-trained Language Models

Autor: Tang, Tianyi, Li, Junyi, Chen, Zhipeng, Hu, Yiwen, Yu, Zhuohao, Dai, Wenxun, Dong, Zican, Cheng, Xiaoxue, Wang, Yuhao, Zhao, Wayne Xin, Nie, Jian-Yun, Wen, Ji-Rong

To facilitate research on text generation, this paper presents a comprehensive and unified library, TextBox 2.0, focusing on the use of pre-trained language models (PLMs). To be comprehensive, our library covers $13$ common text generation tasks and

Externí odkaz: http://arxiv.org/abs/2212.13005

Zobrazit plný text záznamu

Vyhledávací nástroje:

Upřesnit hledání