Výsledky vyhledávání - "Rostamizadeh P"

Report

No more hard prompts: SoftSRV prompting for synthetic data generation

Autor: DeSalvo, Giulia, Kagy, Jean-Fracois, Karydas, Lazaros, Rostamizadeh, Afshin, Kumar, Sanjiv

We present a novel soft prompt based framework, SoftSRV, that leverages a frozen pre-trained large language model (LLM) to generate targeted synthetic text sequences. Given a sample from the target distribution, our proposed framework uses data-drive

Externí odkaz: http://arxiv.org/abs/2410.16534

Zobrazit plný text záznamu

Report

SpacTor-T5: Pre-training T5 Models with Span Corruption and Replaced Token Detection

Autor: Ye, Ke, Jiang, Heinrich, Rostamizadeh, Afshin, Chakrabarti, Ayan, DeSalvo, Giulia, Kagy, Jean-François, Karydas, Lazaros, Citovsky, Gui, Kumar, Sanjiv

Pre-training large language models is known to be extremely resource intensive and often times inefficient, under-utilizing the information encapsulated in the training text sequences. In this paper, we present SpacTor, a new training procedure consi

Externí odkaz: http://arxiv.org/abs/2401.13160

Zobrazit plný text záznamu

Report

DistillSpec: Improving Speculative Decoding via Knowledge Distillation

Autor: Zhou, Yongchao, Lyu, Kaifeng, Rawat, Ankit Singh, Menon, Aditya Krishna, Rostamizadeh, Afshin, Kumar, Sanjiv, Kagy, Jean-François, Agarwal, Rishabh

Speculative decoding (SD) accelerates large language model inference by employing a faster draft model for generating multiple tokens, which are then verified in parallel by the larger target model, resulting in the text generated according to the ta

Externí odkaz: http://arxiv.org/abs/2310.08461

Zobrazit plný text záznamu

Report

Leveraging Importance Weights in Subset Selection

Autor: Citovsky, Gui, DeSalvo, Giulia, Kumar, Sanjiv, Ramalingam, Srikumar, Rostamizadeh, Afshin, Wang, Yunjuan

We present a subset selection algorithm designed to work with arbitrary model families in a practical batch setting. In such a setting, an algorithm can sample examples one at a time but, in order to limit overhead costs, is only able to update its s

Externí odkaz: http://arxiv.org/abs/2301.12052

Zobrazit plný text záznamu

Report

Is margin all you need? An extensive empirical study of active learning on tabular data

Autor: Bahri, Dara, Jiang, Heinrich, Schuster, Tal, Rostamizadeh, Afshin

Given a labeled training set and a collection of unlabeled data, the goal of active learning (AL) is to identify the best unlabeled points to label. In this comprehensive study, we analyze the performance of a variety of AL algorithms on deep neural

Externí odkaz: http://arxiv.org/abs/2210.03822

Zobrazit plný text záznamu

Akademický článek

Tento výsledek nelze pro nepřihlášené uživatele zobrazit.
K zobrazení výsledku je třeba se přihlásit.

Report

Batch Active Learning at Scale

Autor: Citovsky, Gui, DeSalvo, Giulia, Gentile, Claudio, Karydas, Lazaros, Rajagopalan, Anand, Rostamizadeh, Afshin, Kumar, Sanjiv

The ability to train complex and highly effective models often requires an abundance of training data, which can easily become a bottleneck in cost, time, and computational resources. Batch active learning, which adaptively issues batched queries to

Externí odkaz: http://arxiv.org/abs/2107.14263

Zobrazit plný text záznamu

Report

Churn Reduction via Distillation

Autor: Jiang, Heinrich, Narasimhan, Harikrishna, Bahri, Dara, Cotter, Andrew, Rostamizadeh, Afshin

Publikováno v: ICLR 2022

In real-world systems, models are frequently updated as more data becomes available, and in addition to achieving high accuracy, the goal is to also maintain a low difference in predictions compared to the base model (i.e. predictive "churn"). If mod

Externí odkaz: http://arxiv.org/abs/2106.02654

Zobrazit plný text záznamu

Report

Active Covering

Autor: Jiang, Heinrich, Rostamizadeh, Afshin

We analyze the problem of active covering, where the learner is given an unlabeled dataset and can sequentially label query examples. The objective is to label query all of the positive examples in the fewest number of total label queries. We show un

Externí odkaz: http://arxiv.org/abs/2106.02552

Zobrazit plný text záznamu

Vyhledávací nástroje:

Upřesnit hledání