Výsledky vyhledávání - "Zhuang, Shengyao"

Report

Embark on DenseQuest: A System for Selecting the Best Dense Retriever for a Custom Collection

Autor: Khramtsova, Ekaterina, Leelanupab, Teerapong, Zhuang, Shengyao, Baktashmotlagh, Mahsa, Zuccon, Guido

In this demo we present a web-based application for selecting an effective pre-trained dense retriever to use on a private collection. Our system, DenseQuest, provides unsupervised selection and ranking capabilities to predict the best dense retrieve

Externí odkaz: http://arxiv.org/abs/2407.06685

Zobrazit plný text záznamu

Report

Dense Retrieval with Continuous Explicit Feedback for Systematic Review Screening Prioritisation

Autor: Mao, Xinyu, Zhuang, Shengyao, Koopman, Bevan, Zuccon, Guido

The goal of screening prioritisation in systematic reviews is to identify relevant documents with high recall and rank them in early positions for review. This saves reviewing effort if paired with a stopping criterion, and speeds up review completio

Externí odkaz: http://arxiv.org/abs/2407.00635

Zobrazit plný text záznamu

Report

An Investigation of Prompt Variations for Zero-shot LLM-based Rankers

Autor: Sun, Shuoqi, Zhuang, Shengyao, Wang, Shuai, Zuccon, Guido

We provide a systematic understanding of the impact of specific components and wordings used in prompts on the effectiveness of rankers based on zero-shot Large Language Models (LLMs). Several zero-shot ranking methods based on LLMs have recently bee

Externí odkaz: http://arxiv.org/abs/2406.14117

Zobrazit plný text záznamu

Report

The Impact of Auxiliary Patient Data on Automated Chest X-Ray Report Generation and How to Incorporate It

Autor: Nicolson, Aaron, Zhuang, Shengyao, Dowling, Jason, Koopman, Bevan

This study investigates the integration of diverse patient data sources into multimodal language models for automated chest X-ray (CXR) report generation. Traditionally, CXR report generation relies solely on CXR images and limited radiology data, ov

Externí odkaz: http://arxiv.org/abs/2406.13181

Zobrazit plný text záznamu

Report

A Systematic Investigation of Distilling Large Language Models into Cross-Encoders for Passage Re-ranking

Autor: Schlatt, Ferdinand, Fröbe, Maik, Scells, Harrisen, Zhuang, Shengyao, Koopman, Bevan, Zuccon, Guido, Stein, Benno, Potthast, Martin, Hagen, Matthias

Cross-encoders distilled from large language models (LLMs) are often more effective re-rankers than cross-encoders fine-tuned on manually labeled data. However, the distilled models usually do not reach their teacher LLM's effectiveness. To investiga

Externí odkaz: http://arxiv.org/abs/2405.07920

Zobrazit plný text záznamu

Report

PromptReps: Prompting Large Language Models to Generate Dense and Sparse Representations for Zero-Shot Document Retrieval

Autor: Zhuang, Shengyao, Ma, Xueguang, Koopman, Bevan, Lin, Jimmy, Zuccon, Guido

Utilizing large language models (LLMs) for zero-shot document ranking is done in one of two ways: 1) prompt-based re-ranking methods, which require no further training but are only feasible for re-ranking a handful of candidate documents due to compu

Externí odkaz: http://arxiv.org/abs/2404.18424

Zobrazit plný text záznamu

Report

Set-Encoder: Permutation-Invariant Inter-Passage Attention for Listwise Passage Re-Ranking with Cross-Encoders

Autor: Schlatt, Ferdinand, Fröbe, Maik, Scells, Harrisen, Zhuang, Shengyao, Koopman, Bevan, Zuccon, Guido, Stein, Benno, Potthast, Martin, Hagen, Matthias

Existing cross-encoder re-rankers can be categorized as pointwise, pairwise, or listwise models. Pair- and listwise models allow passage interactions, which usually makes them more effective than pointwise models but also less efficient and less robu

Externí odkaz: http://arxiv.org/abs/2404.06912

Zobrazit plný text záznamu

Report

Understanding and Mitigating the Threat of Vec2Text to Dense Retrieval Systems

Autor: Zhuang, Shengyao, Koopman, Bevan, Chu, Xiaoran, Zuccon, Guido

The emergence of Vec2Text -- a method for text embedding inversion -- has raised serious privacy concerns for dense retrieval systems which use text embeddings, such as those offered by OpenAI and Cohere. This threat comes from the ability for a mali

Externí odkaz: http://arxiv.org/abs/2402.12784

Zobrazit plný text záznamu

Report

FeB4RAG: Evaluating Federated Search in the Context of Retrieval Augmented Generation

Autor: Wang, Shuai, Khramtsova, Ekaterina, Zhuang, Shengyao, Zuccon, Guido

Federated search systems aggregate results from multiple search engines, selecting appropriate sources to enhance result quality and align with user intent. With the increasing uptake of Retrieval-Augmented Generation (RAG) pipelines, federated searc

Externí odkaz: http://arxiv.org/abs/2402.11891

Zobrazit plný text záznamu

Report

Large Language Models for Stemming: Promises, Pitfalls and Failures

Autor: Wang, Shuai, Zhuang, Shengyao, Zuccon, Guido

Text stemming is a natural language processing technique that is used to reduce words to their base form, also known as the root form. The use of stemming in IR has been shown to often improve the effectiveness of keyword-matching models such as BM25

Externí odkaz: http://arxiv.org/abs/2402.11757

Zobrazit plný text záznamu

Vyhledávací nástroje:

Upřesnit hledání