Learning to Rank for Multiple Retrieval-Augmented Models through Iterative Utility Maximization

Autor:	Salemi, Alireza, Zamani, Hamed
Rok vydání:	2024
Předmět:	Computer Science - Computation and Language Computer Science - Information Retrieval
Druh dokumentu:	Working Paper
Popis:	This paper investigates the design of a unified search engine to serve multiple retrieval-augmented generation (RAG) agents, each with a distinct task, backbone large language model (LLM), and retrieval-augmentation strategy. We introduce an iterative approach where the search engine generates retrieval results for these RAG agents and gathers feedback on the quality of the retrieved documents during an offline phase. This feedback is then used to iteratively optimize the search engine using a novel expectation-maximization algorithm, with the goal of maximizing each agent's utility function. Additionally, we adapt this approach to an online setting, allowing the search engine to refine its behavior based on real-time individual agents feedback to better serve the results for each of them. Experiments on diverse datasets from the Knowledge-Intensive Language Tasks (KILT) benchmark demonstrates that our approach significantly on average outperforms competitive baselines across 18 RAG models. We also demonstrate that our method effectively ``personalizes'' the retrieval process for each RAG agent based on the collected feedback. Finally, we provide a comprehensive ablation study to explore various aspects of our method.
Databáze:	arXiv
Externí odkaz:	http://arxiv.org/abs/2410.09942 Zobrazit plný text záznamu View this record from Arxiv