Výsledky vyhledávání - "SCHWARTZ, ROY"

Report

Separating Coverage and Submodular: Maximization Subject to a Cardinality Constraint

Autor: Filmus, Yuval, Schwartz, Roy, Smal, Alexander V.

We consider two classic problems: maximum coverage and monotone submodular maximization subject to a cardinality constraint. [Nemhauser--Wolsey--Fisher '78] proved that the greedy algorithm provides an approximation of $1-1/e$ for both problems, and

Externí odkaz: http://arxiv.org/abs/2411.05553

Zobrazit plný text záznamu

Report

From Tokens to Words: On the Inner Lexicon of LLMs

Autor: Kaplan, Guy, Oren, Matanel, Reif, Yuval, Schwartz, Roy

Natural language is composed of words, but modern LLMs process sub-words as input. A natural question raised by this discrepancy is whether LLMs encode words internally, and if so how. We present evidence that LLMs engage in an intrinsic detokenizati

Externí odkaz: http://arxiv.org/abs/2410.05864

Zobrazit plný text záznamu

Report

Attend First, Consolidate Later: On the Importance of Attention in Different LLM Layers

Autor: Ben-Artzy, Amit, Schwartz, Roy

In decoder-based LLMs, the representation of a given layer serves two purposes: as input to the next layer during the computation of the current token; and as input to the attention mechanism of future tokens. In this work, we show that the importanc

Externí odkaz: http://arxiv.org/abs/2409.03621

Zobrazit plný text záznamu

Report

What Can Natural Language Processing Do for Peer Review?

The number of scientific articles produced every year is growing rapidly. Providing quality control over them is crucial for scientists and, ultimately, for the public good. In modern science, this process is largely delegated to peer review -- a dis

Externí odkaz: http://arxiv.org/abs/2405.06563

Zobrazit plný text záznamu

Report

Dynamic Speculation Lookahead Accelerates Speculative Decoding of Large Language Models

Autor: Mamou, Jonathan, Pereg, Oren, Korat, Daniel, Berchansky, Moshe, Timor, Nadav, Wasserblat, Moshe, Schwartz, Roy

Speculative decoding is commonly used for reducing the inference latency of large language models. Its effectiveness depends highly on the speculation lookahead (SL)-the number of tokens generated by the draft model at each iteration. In this work we

Externí odkaz: http://arxiv.org/abs/2405.04304

Zobrazit plný text záznamu

Report

Beyond Performance: Quantifying and Mitigating Label Bias in LLMs

Autor: Reif, Yuval, Schwartz, Roy

Large language models (LLMs) have shown remarkable adaptability to diverse tasks, by leveraging context prompts containing instructions, or minimal input-output examples. However, recent work revealed they also exhibit label bias -- an undesirable pr

Externí odkaz: http://arxiv.org/abs/2405.02743

Zobrazit plný text záznamu

Report

The Larger the Better? Improved LLM Code-Generation via Budget Reallocation

Autor: Hassid, Michael, Remez, Tal, Gehring, Jonas, Schwartz, Roy, Adi, Yossi

It is a common belief that large language models (LLMs) are better than smaller-sized ones. However, larger models also require significantly more time and compute during inference. This begs the question: what happens when both models operate under

Externí odkaz: http://arxiv.org/abs/2404.00725

Zobrazit plný text záznamu

Report

Transformers are Multi-State RNNs

Autor: Oren, Matanel, Hassid, Michael, Yarden, Nir, Adi, Yossi, Schwartz, Roy

Transformers are considered conceptually different from the previous generation of state-of-the-art NLP models - recurrent neural networks (RNNs). In this work, we demonstrate that decoder-only transformers can in fact be conceptualized as unbounded

Externí odkaz: http://arxiv.org/abs/2401.06104

Zobrazit plný text záznamu

Report

On Approximating Cutwidth and Pathwidth

Autor: Bansal, Nikhil, Katzelnick, Dor, Schwartz, Roy

We study graph ordering problems with a min-max objective. A classical problem of this type is cutwidth, where given a graph we want to order its vertices such that the number of edges crossing any point is minimized. We give a $ \log^{1+o(1)}(n)$ ap

Externí odkaz: http://arxiv.org/abs/2311.15639

Zobrazit plný text záznamu

Report

A Tight Competitive Ratio for Online Submodular Welfare Maximization

Autor: Ganz, Amit, Nuti, Pranav, Schwartz, Roy

In this paper we consider the online Submodular Welfare (SW) problem. In this problem we are given $n$ bidders each equipped with a general (not necessarily monotone) submodular utility and $m$ items that arrive online. The goal is to assign each ite

Externí odkaz: http://arxiv.org/abs/2308.07746

Zobrazit plný text záznamu

Vyhledávací nástroje:

Upřesnit hledání