Výsledky vyhledávání - "Lewis, Patrick A."

Report

Replacing Judges with Juries: Evaluating LLM Generations with a Panel of Diverse Models

Autor: Verga, Pat, Hofstatter, Sebastian, Althammer, Sophia, Su, Yixuan, Piktus, Aleksandra, Arkhangorodsky, Arkady, Xu, Minjie, White, Naomi, Lewis, Patrick

As Large Language Models (LLMs) have become more advanced, they have outpaced our abilities to accurately evaluate their quality. Not only is finding data to adequately probe particular model properties difficult, but evaluating the correctness of a

Externí odkaz: http://arxiv.org/abs/2404.18796

Zobrazit plný text záznamu

Report

SnapKV: LLM Knows What You are Looking for Before Generation

Autor: Li, Yuhong, Huang, Yingbing, Yang, Bowen, Venkitesh, Bharat, Locatelli, Acyr, Ye, Hanchen, Cai, Tianle, Lewis, Patrick, Chen, Deming

Large Language Models (LLMs) have made remarkable progress in processing extensive contexts, with the Key-Value (KV) cache playing a vital role in enhancing their performance. However, the growth of the KV cache in response to increasing input length

Externí odkaz: http://arxiv.org/abs/2404.14469

Zobrazit plný text záznamu

Report

From One to Many: Expanding the Scope of Toxicity Mitigation in Language Models

Autor: Pozzobon, Luiza, Lewis, Patrick, Hooker, Sara, Ermis, Beyza

To date, toxicity mitigation in language models has almost entirely been focused on single-language settings. As language models embrace multilingual capabilities, it's crucial our safety measures keep pace. Recognizing this research gap, our approac

Externí odkaz: http://arxiv.org/abs/2403.03893

Zobrazit plný text záznamu

Report

MultiContrievers: Analysis of Dense Retrieval Representations

Autor: Goldfarb-Tarrant, Seraphina, Rodriguez, Pedro, Dwivedi-Yu, Jane, Lewis, Patrick

Dense retrievers compress source documents into (possibly lossy) vector representations, yet there is little analysis of what information is lost versus preserved, and how it affects downstream tasks. We conduct the first analysis of the information

Externí odkaz: http://arxiv.org/abs/2402.15925

Zobrazit plný text záznamu

Report

Rank-without-GPT: Building GPT-Independent Listwise Rerankers on Open-Source Large Language Models

Autor: Zhang, Xinyu, Hofstätter, Sebastian, Lewis, Patrick, Tang, Raphael, Lin, Jimmy

Listwise rerankers based on large language models (LLM) are the zero-shot state-of-the-art. However, current works in this direction all depend on the GPT models, making it a single point of failure in scientific reproducibility. Moreover, it raises

Externí odkaz: http://arxiv.org/abs/2312.02969

Zobrazit plný text záznamu

Report

Goodtriever: Adaptive Toxicity Mitigation with Retrieval-augmented Models

Autor: Pozzobon, Luiza, Ermis, Beyza, Lewis, Patrick, Hooker, Sara

Considerable effort has been dedicated to mitigating toxicity, but existing methods often require drastic modifications to model parameters or the use of computationally intensive auxiliary models. Furthermore, previous approaches have often neglecte

Externí odkaz: http://arxiv.org/abs/2310.07589

Zobrazit plný text záznamu

Kniha

The molecular and clinical pathology of neurodegenerative disease / Patrick A. Lewis, Jennifer E. Spillane. [elektronicky zdroj]

Autor: Lewis, Patrick A., author

Externí odkaz: Kolekce e-knih KNAV

Report

On the Challenges of Using Black-Box APIs for Toxicity Evaluation in Research

Autor: Pozzobon, Luiza, Ermis, Beyza, Lewis, Patrick, Hooker, Sara

Perception of toxicity evolves over time and often differs between geographies and cultural backgrounds. Similarly, black-box commercially available APIs for detecting toxicity, such as the Perspective API, are not static, but frequently retrained to

Externí odkaz: http://arxiv.org/abs/2304.12397

Zobrazit plný text záznamu

Elektronická kniha

For slavery and union : Benjamin Buckner and Kentucky loyalties in the Civil War / Patrick A. Lewis. [electronic resource]

Autor: Lewis, Patrick A., 1984-, author

Externí odkaz: Kolekce e-knih KNAV Registrovani uzivatele: plny text online 5 minut, dalsi pristup na vyzadani. Registered users: full text online 5 minutes, further access on request.

Report

Mini-Model Adaptation: Efficiently Extending Pretrained Models to New Languages via Aligned Shallow Training

Autor: Marchisio, Kelly, Lewis, Patrick, Chen, Yihong, Artetxe, Mikel

Prior work shows that it is possible to expand pretrained Masked Language Models (MLMs) to new languages by learning a new set of embeddings, while keeping the transformer body frozen. Despite learning a small subset of parameters, this approach is n

Externí odkaz: http://arxiv.org/abs/2212.10503

Zobrazit plný text záznamu

Vyhledávací nástroje:

Upřesnit hledání