Clustering with Random Indexing K-tree and XML Structure.

Autor: De Vries, Christopher M., Geva, Shlomo, De Vine, Lance
Zdroj: Focused Retrieval & Evaluation; 2010, p407-415, 9p
Abstrakt: This paper describes the approach taken to the clustering task at INEX 2009 by a group at the Queensland University of Technology. The Random Indexing (RI) K-tree has been used with a representation that is based on the semantic markup available in the INEX 2009 Wikipedia collection. The RI K-tree is a scalable approach to clustering large document collections. This approach has produced quality clustering when evaluated using two different methodologies. [ABSTRACT FROM AUTHOR]
Databáze: Complementary Index