Výsledky vyhledávání - "Beloch, Rahel"

Report

A (More) Realistic Evaluation Setup for Generalisation of Community Models on Malicious Content Detection

Autor: Verhoeven, Ivo, Mishra, Pushkar, Beloch, Rahel, Yannakoudakis, Helen, Shutova, Ekaterina

Community models for malicious content detection, which take into account the context from a social graph alongside the content itself, have shown remarkable performance on benchmark datasets. Yet, misinformation and hate speech continue to propagate

Externí odkaz: http://arxiv.org/abs/2404.01822

Zobrazit plný text záznamu

Report

Identifying and Adapting Transformer-Components Responsible for Gender Bias in an English Language Model

Autor: Chintam, Abhijith, Beloch, Rahel, Zuidema, Willem, Hanna, Michael, van der Wal, Oskar

Language models (LMs) exhibit and amplify many types of undesirable biases learned from the training data, including gender bias. However, we lack tools for effectively and efficiently changing this behavior without hurting general language modeling

Externí odkaz: http://arxiv.org/abs/2310.12611

Zobrazit plný text záznamu

Vyhledávací nástroje:

Upřesnit hledání