What counts as a weak tie? A comparison of filtering techniques to analyze co-exposure networks
Autor: | Georg Stadler, Tian Yang, Subhayan Mukerjee, Sandra González-Bailón |
---|---|
Rok vydání: | 2022 |
Předmět: |
Structure (mathematical logic)
0303 health sciences Sociology and Political Science Heuristic (computer science) Social connectedness Computer science General Social Sciences 010402 general chemistry computer.software_genre 01 natural sciences Measure (mathematics) Thresholding 0104 chemical sciences 03 medical and health sciences Anthropology Social media Noise (video) Data mining Raw data computer General Psychology 030304 developmental biology |
Zdroj: | Social Networks. 68:386-393 |
ISSN: | 0378-8733 |
DOI: | 10.1016/j.socnet.2021.10.002 |
Popis: | Co-exposure networks offer a useful tool for analyzing audience behavior. In these networks, nodes are sources of information and ties measure the strength of audience overlap. Past research has used this method to analyze exposure to content on social media and the web. However, we still lack a systematic assessment of how different choices in the construction of these networks impact the results. Here we evaluate three different filtering rules that have been used in the literature to eliminate noise in raw data and identify the strongest connections (i.e., those above a certain weight). Moreover, we also provide a mathematical heuristic to choose the optimal threshold. To illustrate our approach, we use two observed networks measuring co-exposure to news sources on the web. We then formulate the problem of filtering the networks as a trade-off between network sparsity (i.e., the need to remove the weakest ties) and connectedness (i.e., the need to preserve the observed connectivity). Our mathematical approach resolves this problem by finding the threshold that maximizes the number of edges removed while minimizing the number of nodes becoming isolates. This analytical technique is generalizable and can be applied to the analysis of any weighted structure that requires solving a similar trade-off between network measures. |
Databáze: | OpenAIRE |
Externí odkaz: |