Výsledky vyhledávání - "Bach, Stephen"

Report

Planetarium: A Rigorous Benchmark for Translating Text to Structured Planning Languages

Autor: Zuo, Max, Velez, Francisco Piedrahita, Li, Xiaochen, Littman, Michael L., Bach, Stephen H.

Many recent works have explored using language models for planning problems. One line of research focuses on translating natural language descriptions of planning tasks into structured planning languages, such as the planning domain definition langua

Externí odkaz: http://arxiv.org/abs/2407.03321

Zobrazit plný text záznamu

Report

Preference Tuning For Toxicity Mitigation Generalizes Across Languages

Autor: Li, Xiaochen, Yong, Zheng-Xin, Bach, Stephen H.

Detoxifying multilingual Large Language Models (LLMs) has become crucial due to their increasing global use. In this work, we explore zero-shot cross-lingual generalization of preference tuning in detoxifying LLMs. Unlike previous studies that show l

Externí odkaz: http://arxiv.org/abs/2406.16235

Zobrazit plný text záznamu

Report

If CLIP Could Talk: Understanding Vision-Language Model Representations Through Their Preferred Concept Descriptions

Autor: Esfandiarpoor, Reza, Menghini, Cristina, Bach, Stephen H.

Recent works often assume that Vision-Language Model (VLM) representations are based on visual attributes like shape. However, it is unclear to what extent VLMs prioritize this information to represent concepts. We propose Extract and Explore (EX2),

Externí odkaz: http://arxiv.org/abs/2403.16442

Zobrazit plný text záznamu

Report

Learning to Generate Instruction Tuning Datasets for Zero-Shot Task Adaptation

Autor: Nayak, Nihal V., Nan, Yiyang, Trost, Avi, Bach, Stephen H.

We introduce Bonito, an open-source model for conditional task generation that converts unannotated text into task-specific training datasets for instruction tuning. We aim to enable zero-shot task adaptation of large language models on users' specia

Externí odkaz: http://arxiv.org/abs/2402.18334

Zobrazit plný text záznamu

Report

LexC-Gen: Generating Data for Extremely Low-Resource Languages with Large Language Models and Bilingual Lexicons

Autor: Yong, Zheng-Xin, Menghini, Cristina, Bach, Stephen H.

Data scarcity in low-resource languages can be addressed with word-to-word translations from labeled task data in high-resource languages using bilingual lexicons. However, bilingual lexicons often have limited lexical overlap with task data, which r

Externí odkaz: http://arxiv.org/abs/2402.14086

Zobrazit plný text záznamu

Report

Leveraging Large Language Models for Structure Learning in Prompted Weak Supervision

Autor: Su, Jinyan, Yu, Peilin, Zhang, Jieyu, Bach, Stephen H.

Prompted weak supervision (PromptedWS) applies pre-trained large language models (LLMs) as the basis for labeling functions (LFs) in a weak supervision framework to obtain large labeled datasets. We further extend the use of LLMs in the loop to addre

Externí odkaz: http://arxiv.org/abs/2402.01867

Zobrazit plný text záznamu

Report

Follow-Up Differential Descriptions: Language Models Resolve Ambiguities for Image Classification

Autor: Esfandiarpoor, Reza, Bach, Stephen H.

A promising approach for improving the performance of vision-language models like CLIP for image classification is to extend the class descriptions (i.e., prompts) with related attributes, e.g., using brown sparrow instead of sparrow. However, curren

Externí odkaz: http://arxiv.org/abs/2311.07593

Zobrazit plný text záznamu

Report

Low-Resource Languages Jailbreak GPT-4

Autor: Yong, Zheng-Xin, Menghini, Cristina, Bach, Stephen H.

AI safety training and red-teaming of large language models (LLMs) are measures to mitigate the generation of unsafe content. Our work exposes the inherent cross-lingual vulnerability of these safety mechanisms, resulting from the linguistic inequali

Externí odkaz: http://arxiv.org/abs/2310.02446

Zobrazit plný text záznamu

Report

Enhancing CLIP with CLIP: Exploring Pseudolabeling for Limited-Label Prompt Tuning

Autor: Menghini, Cristina, Delworth, Andrew, Bach, Stephen H.

Fine-tuning vision-language models (VLMs) like CLIP to downstream tasks is often necessary to optimize their performance. However, a major obstacle is the limited availability of labeled data. We study the use of pseudolabels, i.e., heuristic labels

Externí odkaz: http://arxiv.org/abs/2306.01669

Zobrazit plný text záznamu

Report

An Adaptive Method for Weak Supervision with Drifting Data

Autor: Mazzetto, Alessio, Esfandiarpoor, Reza, Upfal, Eli, Bach, Stephen H.

We introduce an adaptive method with formal quality guarantees for weak supervision in a non-stationary setting. Our goal is to infer the unknown labels of a sequence of data by using weak supervision sources that provide independent noisy signals of

Externí odkaz: http://arxiv.org/abs/2306.01658

Zobrazit plný text záznamu

Vyhledávací nástroje:

Upřesnit hledání