Zobrazeno 1 - 2
of 2
pro vyhledávání: '"Beloch, Rahel"'
Community models for malicious content detection, which take into account the context from a social graph alongside the content itself, have shown remarkable performance on benchmark datasets. Yet, misinformation and hate speech continue to propagate
Externí odkaz:
http://arxiv.org/abs/2404.01822
Language models (LMs) exhibit and amplify many types of undesirable biases learned from the training data, including gender bias. However, we lack tools for effectively and efficiently changing this behavior without hurting general language modeling
Externí odkaz:
http://arxiv.org/abs/2310.12611