Effective and Efficient Discovery of Top-k Meta Paths in Heterogeneous Information Networks

Autor: Tsz Nam Chan, Loc Do, Reynold Cheng, Zhipeng Huang, Haoci Zhang, Zichen Zhu
Rok vydání: 2022
Předmět:
Zdroj: IEEE Transactions on Knowledge and Data Engineering. 34:4172-4185
ISSN: 2326-3865
1041-4347
DOI: 10.1109/tkde.2020.3037218
Popis: Heterogeneous information networks (HINs), which are typed graphs with labeled nodes and edges, have attracted tremendous interest from academia and industry. Given two HIN nodes ${s}$ and ${t}$ , and a natural number ${k}$ , we study the discovery of the ${k}$ most important meta paths in real time, which can be used to support friend search, product recommendation, anomaly detection, and graph clustering. In this work, we argue that the shortest path between ${s}$ and ${t}$ may not necessarily be the most important path. As such, we combine several ranking functions, which are based on frequency and rarity, to redefine the unified importance function of the meta paths between ${s}$ and ${t}$ . Although this importance function can capture more information, it is very time-consuming to find top- ${k}$ meta paths using this importance function. Therefore, we integrate this importance function into a multi-step framework, which can efficiently filter some impossible meta paths between ${s}$ and ${t}$ . In addition, we combine bidirectional searching algorithm with this framework to further boost the efficiency performance. The experiment on different datasets shows that our proposed method outperforms state-of-the-art algorithms in terms of effectiveness with reasonable response time.
Databáze: OpenAIRE