Výsledky vyhledávání - "Campos, Jon Ander"

Report

Improving Reward Models with Synthetic Critiques

Autor: Ye, Zihuiwen, Greenlee-Scott, Fraser, Bartolo, Max, Blunsom, Phil, Campos, Jon Ander, Gallé, Matthias

Reward models (RMs) play a critical role in aligning language models through the process of reinforcement learning from human feedback. RMs are trained to predict a score reflecting human preference, which requires significant time and cost for human

Externí odkaz: http://arxiv.org/abs/2405.20850

Zobrazit plný text záznamu

Report

Aya 23: Open Weight Releases to Further Multilingual Progress

This technical report introduces Aya 23, a family of multilingual language models. Aya 23 builds on the recent release of the Aya model (\"Ust\"un et al., 2024), focusing on pairing a highly performant pre-trained model with the recently released Aya

Externí odkaz: http://arxiv.org/abs/2405.15032

Zobrazit plný text záznamu

Report

When to Retrieve: Teaching LLMs to Utilize Information Retrieval Effectively

Autor: Labruna, Tiziano, Campos, Jon Ander, Azkune, Gorka

In this paper, we demonstrate how Large Language Models (LLMs) can effectively learn to use an off-the-shelf information retrieval (IR) system specifically when additional context is required to answer a given question. Given the performance of IR sy

Externí odkaz: http://arxiv.org/abs/2404.19705

Zobrazit plný text záznamu

Report

NLP Evaluation in trouble: On the Need to Measure LLM Data Contamination for each Benchmark

Autor: Sainz, Oscar, Campos, Jon Ander, García-Ferrero, Iker, Etxaniz, Julen, de Lacalle, Oier Lopez, Agirre, Eneko

In this position paper, we argue that the classical evaluation on Natural Language Processing (NLP) tasks using annotated benchmarks is in trouble. The worst kind of data contamination happens when a Large Language Model (LLM) is trained on the test

Externí odkaz: http://arxiv.org/abs/2310.18018

Zobrazit plný text záznamu

Report

Unsupervised Domain Adaption for Neural Information Retrieval

Autor: Dominguez, Carlos, Campos, Jon Ander, Agirre, Eneko, Azkune, Gorka

Neural information retrieval requires costly annotated data for each target domain to be competitive. Synthetic annotation by query generation using Large Language Models or rule-based string manipulation has been proposed as an alternative, but thei

Externí odkaz: http://arxiv.org/abs/2310.09350

Zobrazit plný text záznamu

Report

IXA/Cogcomp at SemEval-2023 Task 2: Context-enriched Multilingual Named Entity Recognition using Knowledge Bases

Autor: García-Ferrero, Iker, Campos, Jon Ander, Sainz, Oscar, Salaberria, Ander, Roth, Dan

Named Entity Recognition (NER) is a core natural language processing task in which pre-trained language models have shown remarkable performance. However, standard benchmarks like CoNLL 2003 do not address many of the challenges that deployed NER sys

Externí odkaz: http://arxiv.org/abs/2304.10637

Zobrazit plný text záznamu

Report

Training Language Models with Language Feedback at Scale

Autor: Scheurer, Jérémy, Campos, Jon Ander, Korbak, Tomasz, Chan, Jun Shern, Chen, Angelica, Cho, Kyunghyun, Perez, Ethan

Pretrained language models often generate outputs that are not in line with human preferences, such as harmful text or factually incorrect summaries. Recent work approaches the above issues by learning from a simple form of human feedback: comparison

Externí odkaz: http://arxiv.org/abs/2303.16755

Zobrazit plný text záznamu

Report

Improving Code Generation by Training with Natural Language Feedback

Autor: Chen, Angelica, Scheurer, Jérémy, Korbak, Tomasz, Campos, Jon Ander, Chan, Jun Shern, Bowman, Samuel R., Cho, Kyunghyun, Perez, Ethan

The potential for pre-trained large language models (LLMs) to use natural language feedback at inference time has been an exciting recent development. We build upon this observation by formalizing an algorithm for learning from natural language feedb

Externí odkaz: http://arxiv.org/abs/2303.16749

Zobrazit plný text záznamu

Report

Training Language Models with Language Feedback

Autor: Scheurer, Jérémy, Campos, Jon Ander, Chan, Jun Shern, Chen, Angelica, Cho, Kyunghyun, Perez, Ethan

Pretrained language models often do not perform tasks in ways that are in line with our preferences, e.g., generating offensive text or factually incorrect summaries. Recent work approaches the above issue by learning from a simple form of human eval

Externí odkaz: http://arxiv.org/abs/2204.14146

Zobrazit plný text záznamu

Vyhledávací nástroje:

Upřesnit hledání