Identifying accurate link predictors based on assortativity of complex networks

Autor: Ahmad F. Al Musawi, Satyaki Roy, Preetam Ghosh
Jazyk: angličtina
Rok vydání: 2022
Předmět:
Zdroj: Scientific Reports, Vol 12, Iss 1, Pp 1-16 (2022)
Druh dokumentu: article
ISSN: 2045-2322
DOI: 10.1038/s41598-022-22843-4
Popis: Abstract Link prediction algorithms in complex networks, such as social networks, biological networks, drug-drug interactions, communication networks, and so on, assign scores to predict potential links between two nodes. Link prediction (LP) enables researchers to learn unknown, new as well as future interactions among the entities being modeled in the complex networks. In addition to measures like degree distribution, clustering coefficient, centrality, etc., another metric to characterize structural properties is network assortativity which measures the tendency of nodes to connect with similar nodes. In this paper, we explore metrics that effectively predict the links based on the assortativity profiles of the complex networks. To this end, we first propose an approach that generates networks of varying assortativity levels and utilize three sets of link prediction models combining the similarity of neighborhoods and preferential attachment. We carry out experiments to study the LP accuracy (measured in terms of area under the precision-recall curve) of the link predictors individually and in combination with other baseline measures. Our analysis shows that link prediction models that explore a large neighborhood around nodes of interest, such as CH2-L2 and CH2-L3, perform consistently for assortative as well as disassortative networks. While common neighbor-based local measures are effective for assortative networks, our proposed combination of common neighbors with node degree is a good choice for the LP metric in disassortative networks. We discuss how this analysis helps achieve the best-parameterized combination of link prediction models and its significance in the context of link prediction from incomplete social and biological network data.
Databáze: Directory of Open Access Journals
Nepřihlášeným uživatelům se plný text nezobrazuje