Nonnegative matrix tri-factorization based clustering in a heterogeneous information network with star network schema

Autor: Xilong Che, Kuo Zhao, Juncheng Hu, Feng Wang, Yongheng Xing, Mo Han
Rok vydání: 2022
Předmět:
Zdroj: Tsinghua Science and Technology. 27:386-395
ISSN: 1007-0214
DOI: 10.26599/tst.2020.9010049
Popis: Heterogeneous Information Networks (HINs) contain multiple types of nodes and edges; therefore, they can preserve the semantic information and structure information. Cluster analysis using an HIN has obvious advantages over a transformation into a homogenous information network, which can promote the clustering results of different types of nodes. In our study, we applied a Nonnegative Matrix Tri-Factorization (NMTF) in a cluster analysis of multiple metapaths in HIN. Unlike the parameter estimation method of the probability distribution in previous studies, NMTF can obtain several dependent latent variables simultaneously, and each latent variable in NMTF is associated with the cluster of the corresponding node in the HIN. The method is suited to co-clustering leveraging multiple metapaths in HIN, because NMTF is employed for multiple nonnegative matrix factorizations simultaneously in our study. Experimental results on the real dataset show that the validity and correctness of our method, and the clustering result are better than that of the existing similar clustering algorithm.
Databáze: OpenAIRE