Clustering social audiences in business information networks
Autor: | Sai-Fu Fung, Ruiqi Hu, Yu Zheng, Shirui Pan, Guodong Long, Celina Ping Yu, Ting Guo |
---|---|
Rok vydání: | 2020 |
Předmět: |
Business information
Social network Computer science business.industry Process (engineering) Node (networking) 02 engineering and technology 01 natural sciences Data science Artificial Intelligence Business networking 0103 physical sciences Signal Processing Business decision mapping Business intelligence 0202 electrical engineering electronic engineering information engineering Key (cryptography) Artificial Intelligence & Image Processing 020201 artificial intelligence & image processing Computer Vision and Pattern Recognition 010306 general physics business Cluster analysis Software |
Zdroj: | Pattern Recognition. 100:107126 |
ISSN: | 0031-3203 |
DOI: | 10.1016/j.patcog.2019.107126 |
Popis: | © 2019 Elsevier Ltd Business information networks involve diverse users and rich content and have emerged as important platforms for enabling business intelligence and business decision making. A key step in an organizations business intelligence process is to cluster users with similar interests into social audiences and discover the roles they play within a business network. In this article, we propose a novel machine-learning approach, called CBIN, that co-clusters business information networks to discover and understand these audiences. The CBIN framework is based on co-factorization. The audience clusters are discovered from a combination of network structures and rich contextual information, such as node interactions and node-content correlations. Since what defines an audience cluster is data-driven, plus they often overlap, pre-determining the number of clusters is usually very difficult. Therefore, we have based CBIN on an overlapping clustering paradigm with a hold-out strategy to discover the optimal number of clusters given the underlying data. Experiments validate an outstanding performance by CBIN compared to other state-of-the-art algorithms on 13 real-world enterprise datasets. |
Databáze: | OpenAIRE |
Externí odkaz: |