Traitor: associating concepts using the world wide web

Autor: Drijfhout, Wanno, Oliver, J., Oliver, Jundt, Wevers, L., Hiemstra, Djoerd
Přispěvatelé: Databases (Former)
Rok vydání: 2013
Předmět:
Zdroj: Proceedings of the 13th Dutch-Belgian Workshop on Information Retrieval, DIR 2013, 56-57
STARTPAGE=56;ENDPAGE=57;TITLE=Proceedings of the 13th Dutch-Belgian Workshop on Information Retrieval, DIR 2013
Popis: We use Common Crawl's 25TB data set of web pages to construct a database of associated concepts using Hadoop. The database can be queried through a web application with two query interfaces. A textual interface allows searching for similarities and differences between multiple concepts using a query language similar to set notation, and a graphical interface allows users to visualize similarity relationships of concepts in a force directed graph.
Databáze: OpenAIRE