On Improving Pseudo-Relevance Feedback Using Pseudo-Irrelevant Documents.

Autor: Raman, Karthik, Udupa, Raghavendra, Bhattacharya, Pushpak, Bhole, Abhijit
Zdroj: Advances in Information Retrieval: 32nd European Conference on Ir Research, Ecir 2010, Milton Keynes, Uk, March 28-31, 2010.proceedings; 2010, p573-576, 4p
Abstrakt: Pseudo-Relevance Feedback (PRF) assumes that the top-ranking n documents of the initial retrieval are relevant and extracts expansion terms from them. In this work, we introduce the notion of pseudo-irrelevant documents, i.e. high-scoring documents outside of top n that are highly unlikely to be relevant. We show how pseudo-irrelevant documents can be used to extract better expansion terms from the top-ranking n documents: good expansion terms are those which discriminate the top-ranking n documents from the pseudo-irrelevant documents. Our approach gives substantial improvements in retrieval performance over Model-based Feedback on several test collections. [ABSTRACT FROM AUTHOR]
Databáze: Complementary Index