Corpus issus du Web : constitution et analyse informationnelle

Autor:	Fabrice Issac, Christophe Fouqueré
Přispěvatelé:	Fouqueré, Christophe
Rok vydání:	2006
Předmět:	Intelligence artificielle et robotique communication homme machine [INFO.INFO-CL] Computer Science [cs]/Computation and Language [cs.CL] General Engineering Ingénierie des langues
Zdroj:	Revue québécoise de linguistique. 32:111-134
ISSN:	1705-4591 0710-0167
DOI:	10.7202/012246ar
Popis:	Comparé à d’autres sources d’informations (documents techniques, articles de journaux, ...), le Web est une source quasi infinie d’informations de toute nature. Cet avantage peut s’avérer contreproductif si une information pertinente se trouve noyée dans une masse d’informations diverses. Notre travail tente donc d’évaluer dans quelle mesure des techniques de traitement automatique du langage naturel peuvent aider dans la recherche d’informations lorsque la base de données textuelles est non organisée. Plus concrètement, notre étude vise la spécification de mécanismes de reformulation de requêtes. Nous tentons ici de décrire la méthodologie de constitution de corpus suivie, puis nous analysons la pertinence informationnelle des pages récupérables sur le web lorsqu’on fait varier la requête initiale. Compared to other information sources (technical documents, news items), the Web offers almost unlimited access to an formation of all kinds. This advantage may be lost if relevant information is buried in the mass of texts. Our research attemps to evaluate how automated language analysis techniques can aid in the search for information in unorganized textual databases. Specifically our study examines the reformulation of search strings. We outline the method for constructing our corpus and then analyse the relevance of web pages retrieved when the initial search string is varied.
Databáze:	OpenAIRE
Externí odkaz:	https://explore.openaire.eu/search/publication?articleId=doi_dedup___::f5198322319da838f5943d58667a11f3 https://doi.org/10.7202/012246ar Zobrazit plný text záznamu