Zobrazeno 1 - 10
of 9 888
pro vyhledávání: '"Tiedemann A"'
Autor:
Zhang, Lemei, Liu, Peng, Henriksboe, Marcus Tiedemann Oekland, Lauvrak, Even W., Gulla, Jon Atle, Ramampiaro, Heri
With the rapid advancement of Natural Language Processing in recent years, numerous studies have shown that generic summaries generated by Large Language Models (LLMs) can sometimes surpass those annotated by experts, such as journalists, according t
Externí odkaz:
http://arxiv.org/abs/2410.03905
Autor:
Ji, Shaoxiong, Li, Zihao, Paul, Indraneil, Paavola, Jaakko, Lin, Peiqin, Chen, Pinzhen, O'Brien, Dayyán, Luo, Hengyu, Schütze, Hinrich, Tiedemann, Jörg, Haddow, Barry
In this work, we introduce EMMA-500, a large-scale multilingual language model continue-trained on texts across 546 languages designed for enhanced multilingual performance, focusing on improving language coverage for low-resource languages. To facil
Externí odkaz:
http://arxiv.org/abs/2409.17892
Autor:
Tiedemann, Silvana, Canales, Jorge Sanchez, Schur, Felix, Sgarlato, Raffaele, Hirth, Lion, Ruhnau, Oliver, Peters, Jonas
The price elasticity of demand can be estimated from observational data using instrumental variables (IV). However, naive IV estimators may be inconsistent in settings with autocorrelated time series. We argue that causal time graphs can simplify IV
Externí odkaz:
http://arxiv.org/abs/2409.15530
Pretrained language models (PLMs) display impressive performances and have captured the attention of the NLP community. Establishing best practices in pretraining has, therefore, become a major focus of NLP research, especially since insights gained
Externí odkaz:
http://arxiv.org/abs/2407.15489
Autor:
Häckel, Timo, von Roenn, Luca, Juchmann, Nemo, Fay, Alexander, Akkermans, Rinie, Tiedemann, Tim, Schmidt, Thomas C.
The trend for Urban Air Mobility (UAM) is growing with prospective air taxis, parcel deliverers, and medical and industrial services. Safe and efficient UAM operation relies on timely communication and reliable data exchange. In this paper, we explor
Externí odkaz:
http://arxiv.org/abs/2405.03290
Multilingual pretraining and fine-tuning have remarkably succeeded in various natural language processing tasks. Transferring representations from one language to another is especially crucial for cross-lingual learning. One can expect machine transl
Externí odkaz:
http://arxiv.org/abs/2403.16777
Autor:
de Gibert, Ona, Nail, Graeme, Arefyev, Nikolay, Bañón, Marta, van der Linde, Jelmer, Ji, Shaoxiong, Zaragoza-Bernabeu, Jaume, Aulamo, Mikko, Ramírez-Sánchez, Gema, Kutuzov, Andrey, Pyysalo, Sampo, Oepen, Stephan, Tiedemann, Jörg
We present the HPLT (High Performance Language Technologies) language resources, a new massive multilingual dataset including both monolingual and bilingual corpora extracted from CommonCrawl and previously unused web crawls from the Internet Archive
Externí odkaz:
http://arxiv.org/abs/2403.14009
Autor:
Mickus, Timothee, Zosa, Elaine, Vázquez, Raúl, Vahtola, Teemu, Tiedemann, Jörg, Segonne, Vincent, Raganato, Alessandro, Apidianaki, Marianna
This paper presents the results of the SHROOM, a shared task focused on detecting hallucinations: outputs from natural language generation (NLG) systems that are fluent, yet inaccurate. Such cases of overgeneration put in jeopardy many NLG applicatio
Externí odkaz:
http://arxiv.org/abs/2403.07726
Autor:
Mickus, Timothee, Grönroos, Stig-Arne, Attieh, Joseph, Boggia, Michele, De Gibert, Ona, Ji, Shaoxiong, Lopi, Niki Andreas, Raganato, Alessandro, Vázquez, Raúl, Tiedemann, Jörg
NLP in the age of monolithic large language models is approaching its limits in terms of size and information that can be handled. The trend goes to modularization, a necessary step into the direction of designing smaller sub-networks and components
Externí odkaz:
http://arxiv.org/abs/2403.07544
Large language models (LLMs) have advanced the state of the art in natural language processing. However, their predominant design for English or a limited set of languages creates a substantial gap in their effectiveness for low-resource languages. T
Externí odkaz:
http://arxiv.org/abs/2401.13303