Zobrazeno 1 - 10
of 28
pro vyhledávání: '"Lent, Heather"'
Autor:
Tatariya, Kushal, Kulmizev, Artur, Poelman, Wessel, Ploeger, Esther, Bollmann, Marcel, Bjerva, Johannes, Luo, Jiaming, Lent, Heather, de Lhoneux, Miryam
Wikipedia's perceived high quality and broad language coverage have established it as a fundamental resource in multilingual NLP. In the context of low-resource languages, however, these quality assumptions are increasingly being scrutinised. This pa
Externí odkaz:
http://arxiv.org/abs/2411.05527
Despite excellent results on benchmarks over a small subset of languages, large language models struggle to process text from languages situated in `lower-resource' scenarios such as dialects/sociolects (national or social varieties of a language), C
Externí odkaz:
http://arxiv.org/abs/2409.12683
Large Language Models (LLMs) are susceptible to malicious influence by cyber attackers through intrusions such as adversarial, backdoor, and embedding inversion attacks. In response, the burgeoning field of LLM Security aims to study and defend again
Externí odkaz:
http://arxiv.org/abs/2408.11749
Emotion classification is a challenging task in NLP due to the inherent idiosyncratic and subjective nature of linguistic expression, especially with code-mixed data. Pre-trained language models (PLMs) have achieved high performance for many tasks an
Externí odkaz:
http://arxiv.org/abs/2402.03137
Textual data is often represented as real-numbered embeddings in NLP, particularly with the popularity of large language models (LLMs) and Embeddings as a Service (EaaS). However, storing sensitive information as embeddings can be susceptible to secu
Externí odkaz:
http://arxiv.org/abs/2401.12192
Autor:
Lent, Heather, Tatariya, Kushal, Dabre, Raj, Chen, Yiyi, Fekete, Marcell, Ploeger, Esther, Zhou, Li, Armstrong, Ruth-Ann, Eijansantos, Abee, Malau, Catriona, Heje, Hans Erik, Lavrinovics, Ernests, Kanojia, Diptesh, Belony, Paul, Bollmann, Marcel, Grobol, Loïc, de Lhoneux, Miryam, Hershcovich, Daniel, DeGraff, Michel, Søgaard, Anders, Bjerva, Johannes
Creoles represent an under-explored and marginalized group of languages, with few available resources for NLP research.While the genealogical ties between Creoles and a number of highly-resourced languages imply a significant potential for transfer l
Externí odkaz:
http://arxiv.org/abs/2310.19567
We aim to learn language models for Creole languages for which large volumes of data are not readily available, and therefore explore the potential transfer from ancestor languages (the 'Ancestry Transfer Hypothesis'). We find that standard transfer
Externí odkaz:
http://arxiv.org/abs/2206.04371
In recent years, the natural language processing (NLP) community has given increased attention to the disparity of efforts directed towards high-resource languages over low-resource ones. Efforts to remedy this delta often begin with translations of
Externí odkaz:
http://arxiv.org/abs/2206.00437
Autor:
Hershcovich, Daniel, Frank, Stella, Lent, Heather, de Lhoneux, Miryam, Abdou, Mostafa, Brandl, Stephanie, Bugliarello, Emanuele, Piqueras, Laura Cabello, Chalkidis, Ilias, Cui, Ruixiang, Fierro, Constanza, Margatina, Katerina, Rust, Phillip, Søgaard, Anders
Various efforts in the Natural Language Processing (NLP) community have been made to accommodate linguistic diversity and serve speakers of many different languages. However, it is important to acknowledge that speakers and the content they produce a
Externí odkaz:
http://arxiv.org/abs/2203.10020
Creole languages such as Nigerian Pidgin English and Haitian Creole are under-resourced and largely ignored in the NLP literature. Creoles typically result from the fusion of a foreign language with multiple local languages, and what grammatical and
Externí odkaz:
http://arxiv.org/abs/2109.06074