Simulating systematic bias in attributed social networks and its effect on rankings of minority nodes
Autor: | Florian Lemmerich, Markus Strohmaier, Leonie Neuhäuser, Michael T. Schaub, Felix I. Stamm |
---|---|
Rok vydání: | 2020 |
Předmět: |
Social and Information Networks (cs.SI)
FOS: Computer and information sciences T57-57.97 Multidisciplinary Observational error Applied mathematics. Quantitative methods Computer Networks and Communications Relational database Computer science Node (networking) Attributed networks Contrast (statistics) Computer Science - Social and Information Networks Network topology computer.software_genre Homophily Social networks Computational Mathematics Bias Edge uncertainty Enhanced Data Rates for GSM Evolution Data mining computer Rankings Network analysis |
Zdroj: | Applied Network Science, Vol 6, Iss 1, Pp 1-22 (2021) |
DOI: | 10.48550/arxiv.2010.11546 |
Popis: | Network analysis provides powerful tools to learn about a variety of social systems. However, most analyses implicitly assume that the considered relational data is error-free, and reliable and accurately reflects the system to be analysed. Especially if the network consists of multiple groups (e.g., genders, races), this assumption conflicts with a range of systematic biases, measurement errors and other inaccuracies that are well documented in the literature. To investigate the effects of such errors we introduce a framework for simulating systematic bias in attributed networks. Our framework enables us to model erroneous edge observations that are driven by external node attributes or errors arising from the (hidden) network structure itself. We exemplify how systematic inaccuracies distort conclusions drawn from network analyses on the task of minority representations in degree-based rankings. By analysing synthetic and real networks with varying homophily levels and group sizes, we find that the effect of introducing systematic edge errors depends on both the type of edge error and the level of homophily in the system: in heterophilic networks, minority representations in rankings are very sensitive to the type of systematic edge error. In contrast, in homophilic networks we find that minorities are at a disadvantage regardless of the type of error present. We thus conclude that the implications of systematic bias in edge data depend on an interplay between network topology and type of systematic error. This emphasises the need for an error model framework as developed here, which provides a first step towards studying the effects of systematic edge-uncertainty for various network analysis tasks. |
Databáze: | OpenAIRE |
Externí odkaz: |