Zobrazeno 1 - 10
of 18
pro vyhledávání: '"Maronikolakis, Antonis"'
Large language models (LLMs) are poised to revolutionize the domain of online fashion retail, enhancing customer experience and discovery of fashion online. LLM-powered conversational agents introduce a new way of discovery by directly interacting wi
Externí odkaz:
http://arxiv.org/abs/2408.08907
Politeness Stereotypes and Attack Vectors: Gender Stereotypes in Japanese and Korean Language Models
In efforts to keep up with the rapid progress and use of large language models, gender bias research is becoming more prevalent in NLP. Non-English bias research, however, is still in its infancy with most work focusing on English. In our work, we st
Externí odkaz:
http://arxiv.org/abs/2306.09752
We introduce HATELEXICON, a lexicon of slurs and targets of hate speech for the countries of Brazil, Germany, India and Kenya, to aid training and interpretability of models. We demonstrate how our lexicon can be used to interpret model predictions,
Externí odkaz:
http://arxiv.org/abs/2304.01890
Humor is a magnetic component in everyday human interactions and communications. Computationally modeling humor enables NLP systems to entertain and engage with users. We investigate the effectiveness of prompting, a new transfer learning paradigm fo
Externí odkaz:
http://arxiv.org/abs/2210.13985
To tackle the rising phenomenon of hate speech, efforts have been made towards data curation and analysis. When it comes to analysis of bias, previous work has focused predominantly on race. In our work, we further investigate bias in hate speech dat
Externí odkaz:
http://arxiv.org/abs/2205.06621
Autor:
Maronikolakis, Antonis, Wisiorek, Axel, Nann, Leah, Jabbar, Haris, Udupa, Sahana, Schuetze, Hinrich
Building on current work on multilingual hate speech (e.g., Ousidhoum et al. (2019)) and hate speech reduction (e.g., Sap et al. (2020)), we present XTREMESPEECH, a new hate speech dataset containing 20,297 social media passages from Brazil, Germany,
Externí odkaz:
http://arxiv.org/abs/2203.11764
In previous work, it has been shown that BERT can adequately align cross-lingual sentences on the word level. Here we investigate whether BERT can also operate as a char-level aligner. The languages examined are English, Fake-English, German and Gree
Externí odkaz:
http://arxiv.org/abs/2109.09700
The size of the vocabulary is a central design choice in large pretrained language models, with respect to both performance and memory requirements. Typically, subword tokenization algorithms such as byte pair encoding and WordPiece are used. In this
Externí odkaz:
http://arxiv.org/abs/2109.05772
False information spread via the internet and social media influences public opinion and user activity, while generative models enable fake content to be generated faster and more cheaply than had previously been possible. In the not so distant futur
Externí odkaz:
http://arxiv.org/abs/2009.13375
Parody is a figurative device used to imitate an entity for comedic or critical purposes and represents a widespread phenomenon in social media through many popular parody accounts. In this paper, we present the first computational study of parody. W
Externí odkaz:
http://arxiv.org/abs/2004.13878