Effective and Efficient Discovery of Top-k Meta Paths in Heterogeneous Information Networks
Autor: | Tsz Nam Chan, Loc Do, Reynold Cheng, Zhipeng Huang, Haoci Zhang, Zichen Zhu |
---|---|
Rok vydání: | 2022 |
Předmět: | |
Zdroj: | IEEE Transactions on Knowledge and Data Engineering. 34:4172-4185 |
ISSN: | 2326-3865 1041-4347 |
DOI: | 10.1109/tkde.2020.3037218 |
Popis: | Heterogeneous information networks (HINs), which are typed graphs with labeled nodes and edges, have attracted tremendous interest from academia and industry. Given two HIN nodes ${s}$ and ${t}$ , and a natural number ${k}$ , we study the discovery of the ${k}$ most important meta paths in real time, which can be used to support friend search, product recommendation, anomaly detection, and graph clustering. In this work, we argue that the shortest path between ${s}$ and ${t}$ may not necessarily be the most important path. As such, we combine several ranking functions, which are based on frequency and rarity, to redefine the unified importance function of the meta paths between ${s}$ and ${t}$ . Although this importance function can capture more information, it is very time-consuming to find top- ${k}$ meta paths using this importance function. Therefore, we integrate this importance function into a multi-step framework, which can efficiently filter some impossible meta paths between ${s}$ and ${t}$ . In addition, we combine bidirectional searching algorithm with this framework to further boost the efficiency performance. The experiment on different datasets shows that our proposed method outperforms state-of-the-art algorithms in terms of effectiveness with reasonable response time. |
Databáze: | OpenAIRE |
Externí odkaz: |