Access to a Large Dictionary of Spanish Synonyms: A Tool for Fuzzy Information Retrieval
Autor: | Jorge Graña-Gil, Santiago Fernández-Lanza, Alejandro Sobrino-Cerdeiriña |
---|---|
Rok vydání: | 2006 |
Předmět: |
Thesaurus (information retrieval)
Information retrieval Relation (database) Computer science Synonym business.industry Process (engineering) Information retrieval applications computer.software_genre Search engine Query expansion Machine-readable dictionary Electronic dictionary Relevance (information retrieval) Artificial intelligence business computer Natural language processing |
Zdroj: | Soft Computing in Web Information Retrieval ISBN: 3540315888 Soft Computing in Web Information Retrieva |
DOI: | 10.1007/3-540-31590-x_15 |
Popis: | We start by analyzing the role of imprecision in information retrieval in the Web, some theoretical contributions for managing this problem and its presence in search engines, with special emphasis on the use of thesaurus in order to increase the number and relevance of the documents retrieved. We then present a general architecture for implementing large dictionaries in natural language processing applications which is able to store a considerable amount of data relating to the words contained in these dictionaries. In this modelling, efficient access to this information is guaranteed by the use of minimal deterministic acyclic finite-state automata. In addition, we implement a Spanish dictionary of synonyms and illustrate how our general model helps to transform the original dictionary into a computational framework capable of representing semantic relations between words. This process allows us to define synonymy as a gradual relation, which makes the final tool more suitable for word sense disambiguation tasks or for information retrieval applications than other traditional approaches. Moreover, our electronic dictionary, called Fdsa, will be freely available very soon for stand-alone use. |
Databáze: | OpenAIRE |
Externí odkaz: |