Hierarchical, Parameter-Free Community Discovery.

Autor: Papadimitriou, Spiros, Sun, Jimeng, Faloutsos, Christos, Yu, Philip S.
Zdroj: Machine Learning & Knowledge Discovery in Databases (9783540874805); 2008, p170-187, 18p
Abstrakt: Given a large bipartite graph (like document-term, or userproduct graph), how can we find meaningful communities, quickly, and automatically? We propose to look for community hierarchies, with communities- within-communities. Our proposed method, the Context-specific Cluster Tree (CCT) finds such communities at multiple levels, with no user intervention, based on information theoretic principles (MDL). More specifically, it partitions the graph into progressively more refined subgraphs, allowing users to quickly navigate from the global, coarse structure of a graph to more focused and local patterns. As a fringe benefit, and also as an additional indication of its quality, it also achieves better compression than typical, non-hierarchical methods. We demonstrate its scalability and effectiveness on real, large graphs. [ABSTRACT FROM AUTHOR]
Databáze: Complementary Index