Zobrazeno 1 - 10
of 330
pro vyhledávání: '"Guerreiro, P. M."'
Large language models (LLMs) have achieved state-of-the-art performance in machine translation (MT) and demonstrated the ability to leverage in-context learning through few-shot examples. However, the mechanisms by which LLMs use different parts of t
Externí odkaz:
http://arxiv.org/abs/2410.16246
Autor:
Agrawal, Sweta, de Souza, José G. C., Rei, Ricardo, Farinhas, António, Faria, Gonçalo, Fernandes, Patrick, Guerreiro, Nuno M, Martins, Andre
Alignment with human preferences is an important step in developing accurate and safe large language models. This is no exception in machine translation (MT), where better handling of language nuances and context-specific variations leads to improved
Externí odkaz:
http://arxiv.org/abs/2410.07779
Autor:
Gisserot-Boukhlef, Hippolyte, Rei, Ricardo, Malherbe, Emmanuel, Hudelot, Céline, Colombo, Pierre, Guerreiro, Nuno M.
Neural metrics for machine translation (MT) evaluation have become increasingly prominent due to their superior correlation with human judgments compared to traditional lexical metrics. Researchers have therefore utilized neural metrics through quali
Externí odkaz:
http://arxiv.org/abs/2409.20059
Autor:
Martins, Pedro Henrique, Fernandes, Patrick, Alves, João, Guerreiro, Nuno M., Rei, Ricardo, Alves, Duarte M., Pombal, José, Farajian, Amin, Faysse, Manuel, Klimaszewski, Mateusz, Colombo, Pierre, Haddow, Barry, de Souza, José G. C., Birch, Alexandra, Martins, André F. T.
The quality of open-weight LLMs has seen significant improvement, yet they remain predominantly focused on English. In this paper, we introduce the EuroLLM project, aimed at developing a suite of open-weight multilingual LLMs capable of understanding
Externí odkaz:
http://arxiv.org/abs/2409.16235
Autor:
Treviso, Marcos, Guerreiro, Nuno M., Agrawal, Sweta, Rei, Ricardo, Pombal, José, Vaz, Tania, Wu, Helena, Silva, Beatriz, van Stigt, Daan, Martins, André F. T.
While machine translation (MT) systems are achieving increasingly strong performance on benchmarks, they often produce translations with errors and anomalies. Understanding these errors can potentially help improve the translation quality and user ex
Externí odkaz:
http://arxiv.org/abs/2406.19482
Autor:
Ghimire, Sulav, Guerreiro, Gabriel M. G., K., Kanakesh V., Guest, Emerson D., Jensen, Kim H., Yang, Guangya, Wang, Xiongfei
Throughout the past few years, various transmission system operators (TSOs) and research institutes have defined several functional specifications for grid-forming (GFM) converters via grid codes, white papers, and technical documents. These institut
Externí odkaz:
http://arxiv.org/abs/2405.05030
Autor:
Alves, Duarte M., Pombal, José, Guerreiro, Nuno M., Martins, Pedro H., Alves, João, Farajian, Amin, Peters, Ben, Rei, Ricardo, Fernandes, Patrick, Agrawal, Sweta, Colombo, Pierre, de Souza, José G. C., Martins, André F. T.
While general-purpose large language models (LLMs) demonstrate proficiency on multiple tasks within the domain of translation, approaches based on open LLMs are competitive only when specializing on a single task. In this paper, we propose a recipe f
Externí odkaz:
http://arxiv.org/abs/2402.17733
Autor:
Ghimire, Sulav, Kkuni, Kanakesh V., Guerreiro, Gabriel M. G., Guest, Emerson D., Jensen, Kim H., Yang, Guangya
This paper studies control interactions between grid-forming (GFM) converters exhibited by power and frequency oscillations in a weakly connected offshore wind power plant (WPP). Two GFM controls are considered, namely virtual synchronous machine (VS
Externí odkaz:
http://arxiv.org/abs/2402.14317
Hallucinated translations pose significant threats and safety concerns when it comes to the practical deployment of machine translation systems. Previous research works have identified that detectors exhibit complementary performance different detect
Externí odkaz:
http://arxiv.org/abs/2402.13331
Autor:
Faysse, Manuel, Fernandes, Patrick, Guerreiro, Nuno M., Loison, António, Alves, Duarte M., Corro, Caio, Boizard, Nicolas, Alves, João, Rei, Ricardo, Martins, Pedro H., Casademunt, Antoni Bigata, Yvon, François, Martins, André F. T., Viaud, Gautier, Hudelot, Céline, Colombo, Pierre
We introduce CroissantLLM, a 1.3B language model pretrained on a set of 3T English and French tokens, to bring to the research and industrial community a high-performance, fully open-sourced bilingual model that runs swiftly on consumer-grade local h
Externí odkaz:
http://arxiv.org/abs/2402.00786