Vicarious Offense and Noise Audit of Offensive Speech Classifiers

Autor:	Weerasooriya, Tharindu Cyril, Dutta, Sujan, Ranasinghe, Tharindu, Zampieri, Marcos, Homan, Christopher M., KhudaBukhsh, Ashiqur R.
Jazyk:	angličtina
Rok vydání:	2023
Předmět:	FOS: Computer and information sciences Computer Science - Computers and Society Computer Science - Machine Learning Computer Science - Computation and Language Computers and Society (cs.CY) Computation and Language (cs.CL) Machine Learning (cs.LG)
Popis:	This paper examines social web content moderation from two key perspectives: automated methods (machine moderators) and human evaluators (human moderators). We conduct a noise audit at an unprecedented scale using nine machine moderators trained on well-known offensive speech data sets evaluated on a corpus sampled from 92 million YouTube comments discussing a multitude of issues relevant to US politics. We introduce a first-of-its-kind data set of vicarious offense. We ask annotators: (1) if they find a given social media post offensive; and (2) how offensive annotators sharing different political beliefs would find the same content. Our experiments with machine moderators reveal that moderation outcomes wildly vary across different machine moderators. Our experiments with human moderators suggest that (1) political leanings considerably affect first-person offense perspective; (2) Republicans are the worst predictors of vicarious offense; (3) predicting vicarious offense for the Republicans is most challenging than predicting vicarious offense for the Independents and the Democrats; and (4) disagreement across political identity groups considerably increases when sensitive issues such as reproductive rights or gun control/rights are discussed. Both experiments suggest that offense, is indeed, highly subjective and raise important questions concerning content moderation practices.
Databáze:	OpenAIRE
Externí odkaz:	https://explore.openaire.eu/search/publication?articleId=doi_dedup___::3084c85b4e0268d143d38b4ca44bfa7d http://arxiv.org/abs/2301.12534 Zobrazit plný text záznamu