Duplicate detection algorithms of bibliographic descriptions
Autor: | Sarantos Kapidakis, Anestis Sitas |
---|---|
Rok vydání: | 2008 |
Předmět: |
Information retrieval
Computer science Process (engineering) Information treatment for information services Information functions and techniques Cataloguing bibliographic control Cataloging Bibliographic systems Single step Διαχείριση υπηρεσιών λειτουργιών και τεχνικών πληροφόρησης Καταλογογράφηση βιβλιογραφικός έλεγχος Document management system Library and Information Sciences computer.software_genre Duplicate detection Information system Records management Library classification State (computer science) Data mining Cataloguing computer Algorithm Algorithms Information Systems |
Zdroj: | Library and Information Science Abstracts (LISA) Library Hi Tech 26.2 (2008): 287-301. |
ISSN: | 0737-8831 |
DOI: | 10.1108/07378830810880379 |
Popis: | Περιέχει το πλήρες κείμενο Purpose - The purpose of this paper is to focus on duplicate record detection algorithms used for detection in bibliographic databases. Design-methodology-approach - Individual algorithms, their application process for duplicate detection and their results are described based on available literature (published articles), information found at various library web sites and follow-up e-mail communications. Findings - Algorithms are categorized according to their application as a process of a single step or two consecutive steps. The results of deletion, merging, and temporary and virtual consolidation of duplicate records are studied. Originality-value - The paper presents an overview of the duplication detection algorithms and an up-to-date state of their application in different library systems. |
Databáze: | OpenAIRE |
Externí odkaz: |