Access to a Large Dictionary of Spanish Synonyms: A Tool for Fuzzy Information Retrieval

Autor: Jorge Graña-Gil, Santiago Fernández-Lanza, Alejandro Sobrino-Cerdeiriña
Rok vydání: 2006
Předmět:
Zdroj: Soft Computing in Web Information Retrieval ISBN: 3540315888
Soft Computing in Web Information Retrieva
DOI: 10.1007/3-540-31590-x_15
Popis: We start by analyzing the role of imprecision in information retrieval in the Web, some theoretical contributions for managing this problem and its presence in search engines, with special emphasis on the use of thesaurus in order to increase the number and relevance of the documents retrieved. We then present a general architecture for implementing large dictionaries in natural language processing applications which is able to store a considerable amount of data relating to the words contained in these dictionaries. In this modelling, efficient access to this information is guaranteed by the use of minimal deterministic acyclic finite-state automata. In addition, we implement a Spanish dictionary of synonyms and illustrate how our general model helps to transform the original dictionary into a computational framework capable of representing semantic relations between words. This process allows us to define synonymy as a gradual relation, which makes the final tool more suitable for word sense disambiguation tasks or for information retrieval applications than other traditional approaches. Moreover, our electronic dictionary, called Fdsa, will be freely available very soon for stand-alone use.
Databáze: OpenAIRE