Výsledky vyhledávání - "Martins, andré"

Report

A Context-aware Framework for Translation-mediated Conversations

Autor: Pombal, José, Agrawal, Sweta, Fernandes, Patrick, Zaranis, Emmanouil, Martins, André F. T.

Effective communication is fundamental to any interaction, yet challenges arise when participants do not share a common language. Automatic translation systems offer a powerful solution to bridge language barriers in such scenarios, but they introduc

Externí odkaz: http://arxiv.org/abs/2412.04205

Zobrazit plný text záznamu

Report

Global MMLU: Understanding and Addressing Cultural and Linguistic Biases in Multilingual Evaluation

Cultural biases in multilingual datasets pose significant challenges for their effectiveness as global benchmarks. These biases stem not only from language but also from the cultural knowledge required to interpret questions, reducing the practical u

Externí odkaz: http://arxiv.org/abs/2412.03304

Zobrazit plný text záznamu

Report

Hopfield-Fenchel-Young Networks: A Unified Framework for Associative Memory Retrieval

Autor: Santos, Saul, Niculae, Vlad, McNamee, Daniel, Martins, André F. T.

Associative memory models, such as Hopfield networks and their modern variants, have garnered renewed interest due to advancements in memory capacity and connections with self-attention in transformers. In this work, we introduce a unified framework-

Externí odkaz: http://arxiv.org/abs/2411.08590

Zobrazit plný text záznamu

Report

Fine-Grained Reward Optimization for Machine Translation using Error Severity Mappings

Autor: Ramos, Miguel Moura, Almeida, Tomás, Vareta, Daniel, Azevedo, Filipe, Agrawal, Sweta, Fernandes, Patrick, Martins, André F. T.

Reinforcement learning (RL) has been proven to be an effective and robust method for training neural machine translation systems, especially when paired with powerful reward models that accurately assess translation quality. However, most research ha

Externí odkaz: http://arxiv.org/abs/2411.05986

Zobrazit plný text záznamu

Report

Analyzing Context Contributions in LLM-based Machine Translation

Autor: Zaranis, Emmanouil, Guerreiro, Nuno M., Martins, André F. T.

Large language models (LLMs) have achieved state-of-the-art performance in machine translation (MT) and demonstrated the ability to leverage in-context learning through few-shot examples. However, the mechanisms by which LLMs use different parts of t

Externí odkaz: http://arxiv.org/abs/2410.16246

Zobrazit plný text záznamu

Report

Watching the Watchers: Exposing Gender Disparities in Machine Translation Quality Estimation

Autor: Zaranis, Emmanouil, Attanasio, Giuseppe, Agrawal, Sweta, Martins, André F. T.

The automatic assessment of translation quality has recently become crucial across several stages of the translation pipeline, from data curation to training and decoding. Although quality estimation (QE) metrics have been optimized to align with hum

Externí odkaz: http://arxiv.org/abs/2410.10995

Zobrazit plný text záznamu

Report

Modeling User Preferences with Automatic Metrics: Creating a High-Quality Preference Dataset for Machine Translation

Autor: Agrawal, Sweta, de Souza, José G. C., Rei, Ricardo, Farinhas, António, Faria, Gonçalo, Fernandes, Patrick, Guerreiro, Nuno M, Martins, Andre

Alignment with human preferences is an important step in developing accurate and safe large language models. This is no exception in machine translation (MT), where better handling of language nuances and context-specific variations leads to improved

Externí odkaz: http://arxiv.org/abs/2410.07779

Zobrazit plný text záznamu

Report

EuroLLM: Multilingual Language Models for Europe

Autor: Martins, Pedro Henrique, Fernandes, Patrick, Alves, João, Guerreiro, Nuno M., Rei, Ricardo, Alves, Duarte M., Pombal, José, Farajian, Amin, Faysse, Manuel, Klimaszewski, Mateusz, Colombo, Pierre, Haddow, Barry, de Souza, José G. C., Birch, Alexandra, Martins, André F. T.

The quality of open-weight LLMs has seen significant improvement, yet they remain predominantly focused on English. In this paper, we introduce the EuroLLM project, aimed at developing a suite of open-weight multilingual LLMs capable of understanding

Externí odkaz: http://arxiv.org/abs/2409.16235

Zobrazit plný text záznamu

Report

Reranking Laws for Language Generation: A Communication-Theoretic Perspective

Autor: Farinhas, António, Li, Haau-Sing, Martins, André F. T.

To ensure large language models (LLMs) are used safely, one must reduce their propensity to hallucinate or to generate unacceptable answers. A simple and often used strategy is to first let the LLM generate multiple hypotheses and then employ a reran

Externí odkaz: http://arxiv.org/abs/2409.07131

Zobrazit plný text záznamu

Report

DOCE: Finding the Sweet Spot for Execution-Based Code Generation

Autor: Li, Haau-Sing, Fernandes, Patrick, Gurevych, Iryna, Martins, André F. T.

Recently, a diverse set of decoding and reranking procedures have been shown effective for LLM-based code generation. However, a comprehensive framework that links and experimentally compares these methods is missing. We address this by proposing Dec

Externí odkaz: http://arxiv.org/abs/2408.13745

Zobrazit plný text záznamu

Vyhledávací nástroje:

Upřesnit hledání