Information Retrieval: Concepts, Models, and Systems
Autor: | Venkat N. Gudivada, Dhana Rao, Amogh R. Gudivada |
---|---|
Rok vydání: | 2018 |
Předmět: |
Information retrieval
Phrase Computer science 05 social sciences Probabilistic logic Relevance feedback 02 engineering and technology Weighting Index (publishing) 020204 information systems 0202 electrical engineering electronic engineering information engineering Preprocessor Language model Reference architecture 0509 other social sciences 050904 information & library sciences |
DOI: | 10.1016/bs.host.2018.07.009 |
Popis: | This chapter presents a tutorial introduction to modern information retrieval concepts, models, and systems. It begins with a reference architecture for the current Information Retrieval (IR) systems, which provides a backdrop for rest of the chapter. Text preprocessing is discussed using a mini Gutenberg corpus. Next, a categorization of IR models is presented followed by Boolean IR model description. Positional index is introduced, and execution of phrase and proximity queries is discussed. Various term weighting schemes are discussed next followed by descriptions of three IR models—Vector Space, Probabilistic, and Language models. Approaches to evaluating IR systems are presented. Relevance feedback techniques as a means to improving retrieval effectiveness are described. Various IR libraries, frameworks, and test collections are indicated. The chapter concludes by outlining facets of IR research and indicating additional reading. |
Databáze: | OpenAIRE |
Externí odkaz: |