Zobrazeno 1 - 10
of 9 811
pro vyhledávání: '"Tiedemann, A"'
Pretrained language models (PLMs) display impressive performances and have captured the attention of the NLP community. Establishing the best practices in pretraining has therefore become a major point of focus for much of NLP research -- especially
Externí odkaz:
http://arxiv.org/abs/2407.15489
Autor:
Häckel, Timo, von Roenn, Luca, Juchmann, Nemo, Fay, Alexander, Akkermans, Rinie, Tiedemann, Tim, Schmidt, Thomas C.
The trend for Urban Air Mobility (UAM) is growing with prospective air taxis, parcel deliverers, and medical and industrial services. Safe and efficient UAM operation relies on timely communication and reliable data exchange. In this paper, we explor
Externí odkaz:
http://arxiv.org/abs/2405.03290
Multilingual pretraining and fine-tuning have remarkably succeeded in various natural language processing tasks. Transferring representations from one language to another is especially crucial for cross-lingual learning. One can expect machine transl
Externí odkaz:
http://arxiv.org/abs/2403.16777
Autor:
de Gibert, Ona, Nail, Graeme, Arefyev, Nikolay, Bañón, Marta, van der Linde, Jelmer, Ji, Shaoxiong, Zaragoza-Bernabeu, Jaume, Aulamo, Mikko, Ramírez-Sánchez, Gema, Kutuzov, Andrey, Pyysalo, Sampo, Oepen, Stephan, Tiedemann, Jörg
We present the HPLT (High Performance Language Technologies) language resources, a new massive multilingual dataset including both monolingual and bilingual corpora extracted from CommonCrawl and previously unused web crawls from the Internet Archive
Externí odkaz:
http://arxiv.org/abs/2403.14009
Autor:
Mickus, Timothee, Zosa, Elaine, Vázquez, Raúl, Vahtola, Teemu, Tiedemann, Jörg, Segonne, Vincent, Raganato, Alessandro, Apidianaki, Marianna
This paper presents the results of the SHROOM, a shared task focused on detecting hallucinations: outputs from natural language generation (NLG) systems that are fluent, yet inaccurate. Such cases of overgeneration put in jeopardy many NLG applicatio
Externí odkaz:
http://arxiv.org/abs/2403.07726
Autor:
Mickus, Timothee, Grönroos, Stig-Arne, Attieh, Joseph, Boggia, Michele, De Gibert, Ona, Ji, Shaoxiong, Lopi, Niki Andreas, Raganato, Alessandro, Vázquez, Raúl, Tiedemann, Jörg
NLP in the age of monolithic large language models is approaching its limits in terms of size and information that can be handled. The trend goes to modularization, a necessary step into the direction of designing smaller sub-networks and components
Externí odkaz:
http://arxiv.org/abs/2403.07544
Large language models (LLMs) have advanced the state of the art in natural language processing. However, their predominant design for English or a limited set of languages creates a substantial gap in their effectiveness for low-resource languages. T
Externí odkaz:
http://arxiv.org/abs/2401.13303
Accessibility of research data is critical for advances in many research fields, but textual data often cannot be shared due to the personal and sensitive information which it contains, e.g names or political opinions. General Data Protection Regulat
Externí odkaz:
http://arxiv.org/abs/2308.16109
Autor:
Ralph Tiedemann, Rüdiger Riesch, Maxi Tomowski, Katja Havenstein, Jan Schlupp, Waldir Miron Berbel-Filho, Ingo Schlupp
Publikováno v:
BMC Ecology and Evolution, Vol 24, Iss 1, Pp 1-20 (2024)
Abstract Widespread species often experience significant environmental clines over the area they naturally occupy. We investigated a widespread livebearing fish, the Sailfin molly (Poecilia latipinna) combining genetic, life-history, and environmenta
Externí odkaz:
https://doaj.org/article/ff841b1cc54743d1b6262c1c27fe3797
This paper examines empirical methods for estimating the response of aggregated electricity demand to high-frequency price signals, the short-term elasticity of electricity demand. We investigate how the endogeneity of prices and the autocorrelation
Externí odkaz:
http://arxiv.org/abs/2306.12863