Výsledky vyhledávání

Report

Drowning in Documents: Consequences of Scaling Reranker Inference

Autor: Jacob, Mathew, Lindgren, Erik, Zaharia, Matei, Carbin, Michael, Khattab, Omar, Drozdov, Andrew

Rerankers, typically cross-encoders, are often used to re-score the documents retrieved by cheaper initial IR systems. This is because, though expensive, rerankers are assumed to be more effective. We challenge this assumption by measuring reranker p

Externí odkaz: http://arxiv.org/abs/2411.11767

Zobrazit plný text záznamu

Report

Problem-Oriented Segmentation and Retrieval: Case Study on Tutoring Conversations

Autor: Wang, Rose E., Wirawarn, Pawan, Lam, Kenny, Khattab, Omar, Demszky, Dorottya

Many open-ended conversations (e.g., tutoring lessons or business meetings) revolve around pre-defined reference materials, like worksheets or meeting bullets. To provide a framework for studying such conversation structure, we introduce Problem-Orie

Externí odkaz: http://arxiv.org/abs/2411.07598

Zobrazit plný text záznamu

Report

Grounding by Trying: LLMs with Reinforcement Learning-Enhanced Retrieval

Autor: Hsu, Sheryl, Khattab, Omar, Finn, Chelsea, Sharma, Archit

The hallucinations of large language models (LLMs) are increasingly mitigated by allowing LLMs to search for information and to ground their answers in real sources. Unfortunately, LLMs often struggle with posing the right search queries, especially

Externí odkaz: http://arxiv.org/abs/2410.23214

Zobrazit plný text záznamu

Report

PAPILLON: PrivAcy Preservation from Internet-based and Local Language MOdel ENsembles

Autor: Siyan, Li, Raghuram, Vethavikashini Chithrra, Khattab, Omar, Hirschberg, Julia, Yu, Zhou

Users can divulge sensitive information to proprietary LLM providers, raising significant privacy concerns. While open-source models, hosted locally on the user's machine, alleviate some concerns, models that users can host locally are often less cap

Externí odkaz: http://arxiv.org/abs/2410.17127

Zobrazit plný text záznamu

Report

Transformers Utilization in Chart Understanding: A Review of Recent Advances & Future Trends

Autor: Al-Shetairy, Mirna, Hindy, Hanan, Khattab, Dina, Aref, Mostafa M.

In recent years, interest in vision-language tasks has grown, especially those involving chart interactions. These tasks are inherently multimodal, requiring models to process chart images, accompanying text, underlying data tables, and often user qu

Externí odkaz: http://arxiv.org/abs/2410.13883

Zobrazit plný text záznamu

Report

Fine-Tuning and Prompt Optimization: Two Great Steps that Work Better Together

Autor: Soylu, Dilara, Potts, Christopher, Khattab, Omar

Natural Language Processing (NLP) systems are increasingly taking the form of sophisticated modular pipelines, e.g., Retrieval Augmented Generation (RAG), where each module may involve a distinct Language Model (LM) and an associated prompt template.

Externí odkaz: http://arxiv.org/abs/2407.10930

Zobrazit plný text záznamu

Report

HDKD: Hybrid Data-Efficient Knowledge Distillation Network for Medical Image Classification

Autor: EL-Assiouti, Omar S., Hamed, Ghada, Khattab, Dina, Ebied, Hala M.

Vision Transformers (ViTs) have achieved significant advancement in computer vision tasks due to their powerful modeling capacity. However, their performance notably degrades when trained with insufficient data due to lack of inherent inductive biase

Externí odkaz: http://arxiv.org/abs/2407.07516

Zobrazit plný text záznamu

Report

Prompts as Auto-Optimized Training Hyperparameters: Training Best-in-Class IR Models from Scratch with 10 Gold Labels

Autor: Xian, Jasper, Samuel, Saron, Khoubsirat, Faraz, Pradeep, Ronak, Sultan, Md Arafat, Florian, Radu, Roukos, Salim, Sil, Avirup, Potts, Christopher, Khattab, Omar

We develop a method for training small-scale (under 100M parameter) neural information retrieval models with as few as 10 gold relevance labels. The method depends on generating synthetic queries for documents using a language model (LM), and the key

Externí odkaz: http://arxiv.org/abs/2406.11706

Zobrazit plný text záznamu

Report

Optimizing Instructions and Demonstrations for Multi-Stage Language Model Programs

Autor: Opsahl-Ong, Krista, Ryan, Michael J, Purtell, Josh, Broman, David, Potts, Christopher, Zaharia, Matei, Khattab, Omar

Language Model Programs, i.e. sophisticated pipelines of modular language model (LM) calls, are increasingly advancing NLP tasks, but they require crafting prompts that are jointly effective for all modules. We study prompt optimization for LM progra

Externí odkaz: http://arxiv.org/abs/2406.11695

Zobrazit plný text záznamu

Report

Backtracing: Retrieving the Cause of the Query

Autor: Wang, Rose E., Wirawarn, Pawan, Khattab, Omar, Goodman, Noah, Demszky, Dorottya

Many online content portals allow users to ask questions to supplement their understanding (e.g., of lectures). While information retrieval (IR) systems may provide answers for such user queries, they do not directly assist content creators -- such a

Externí odkaz: http://arxiv.org/abs/2403.03956

Zobrazit plný text záznamu

Vyhledávací nástroje:

Upřesnit hledání