Searching by approximate personal-name matching

Autor: Camps Pare, Rafael, Daude Ventura, Jordi
Přispěvatelé: Universitat Politècnica de Catalunya. GPLN - Grup de Processament del Llenguatge Natural
Předmět:
Zdroj: Recercat. Dipósit de la Recerca de Catalunya
instname
UPCommons. Portal del coneixement obert de la UPC
Universitat Politècnica de Catalunya (UPC)
Popis: We discuss the design, building and evaluation of a method to access theinformation of a person, using his name as a search key, even if it has deformations. We present a similarity function, the DEA function, based on the probabilities of the edit operations accordingly to the involved letters and their position, and using a variable threshold. The efficacy of DEA is quantitatively evaluated, without human relevance judgments, very superior to the efficacy of known methods. A very efficient approximate search technique for the DEA function is also presented based on a compacted trie-tree structure.
Databáze: OpenAIRE