Autor: |
Chung, Yeh-Ching, Moreira, José E., Wang, Tong, Liu, Da-Xin, Lin, Xuan-Zuo, Sun, Wei, Ahmad, Gufran |
Zdroj: |
Advances in Grid & Pervasive Computing; 2006, p447-455, 9p |
Abstrakt: |
Clustering is able to facilitate Information Retrieval. This paper addresses the issue of clustering a large number of XML documents. We propose ICX algorithm with a novel similarity metric based on quantitative path. In our approach, each document is firstly represented by path sequences extracted from XML trees. Then these sequences are mapped into quantitative path, by which the distance between documents can be computed with low complexity. Finally, the desired clusters are constructed by utilizing ICX method with literal local search. Experimental results, based on XML documents obtained from DBLP, show the effectiveness and good performance of the proposed techniques. [ABSTRACT FROM AUTHOR] |
Databáze: |
Supplemental Index |
Externí odkaz: |
|