FIMSIM: Discovering Communities by Frequent Item-Set Mining and Similarity Search
Autor: | Michal Batko, Pavel Zezula, Jan Sedmidubský, Jakub Valčík, Jakub Peschel |
---|---|
Rok vydání: | 2021 |
Předmět: |
Computer science
Nearest neighbor search Function (mathematics) computer.software_genre 01 natural sciences 010305 fluids & plasmas Similarity (network science) 0103 physical sciences Graph (abstract data type) Data mining 010306 general physics Control (linguistics) Cluster analysis computer Selection (genetic algorithm) Network analysis |
Zdroj: | Similarity Search and Applications ISBN: 9783030896560 SISAP |
DOI: | 10.1007/978-3-030-89657-7_28 |
Popis: | With the growth of structured graph data, the analysis of networks is an important topic. Community mining is one of the main analytical tasks of network analysis. Communities are dense clusters of nodes, possibly containing additional information about a network. In this paper, we present a community-detection approach, called FIMSIM, which is based on principles of frequent item-set mining and similarity search. The frequent item-set mining is used to extract cores of the communities, and a proposed similarity function is applied to discover suitable surroundings of the cores. The proposed approach outperforms the state-of-the-art DB-Link Clustering algorithm while enabling the easier selection of parameters. In addition, possible modifications are proposed to control the resulting communities better. |
Databáze: | OpenAIRE |
Externí odkaz: |