Výsledky vyhledávání - "Soltan, Saleh"

Report

GeMQuAD : Generating Multilingual Question Answering Datasets from Large Language Models using Few Shot Learning

Autor: Namboori, Amani, Mangale, Shivam, Rosenbaum, Andy, Soltan, Saleh

The emergence of Large Language Models (LLMs) with capabilities like In-Context Learning (ICL) has ushered in new possibilities for data generation across various domains while minimizing the need for extensive data collection and modeling techniques

Externí odkaz: http://arxiv.org/abs/2404.09163

Zobrazit plný text záznamu

Report

Recipes for Sequential Pre-training of Multilingual Encoder and Seq2Seq Models

Autor: Soltan, Saleh, Rosenbaum, Andy, Falke, Tobias, Lu, Qin, Rumshisky, Anna, Hamza, Wael

Pre-trained encoder-only and sequence-to-sequence (seq2seq) models each have advantages, however training both model types from scratch is computationally expensive. We explore recipes to improve pre-training efficiency by initializing one model from

Externí odkaz: http://arxiv.org/abs/2306.08756

Zobrazit plný text záznamu

Report

CLASP: Few-Shot Cross-Lingual Data Augmentation for Semantic Parsing

Autor: Rosenbaum, Andy, Soltan, Saleh, Hamza, Wael, Saffari, Amir, Damonte, Marco, Groves, Isabel

A bottleneck to developing Semantic Parsing (SP) models is the need for a large volume of human-labeled training data. Given the complexity and cost of human annotation for SP, labeled data is often scarce, particularly in multilingual settings. Larg

Externí odkaz: http://arxiv.org/abs/2210.07074

Zobrazit plný text záznamu

Report

LINGUIST: Language Model Instruction Tuning to Generate Annotated Utterances for Intent Classification and Slot Tagging

Autor: Rosenbaum, Andy, Soltan, Saleh, Hamza, Wael, Versley, Yannick, Boese, Markus

We present LINGUIST, a method for generating annotated data for Intent Classification and Slot Tagging (IC+ST), via fine-tuning AlexaTM 5B, a 5-billion-parameter multilingual sequence-to-sequence (seq2seq) model, on a flexible instruction prompt. In

Externí odkaz: http://arxiv.org/abs/2209.09900

Zobrazit plný text záznamu

Report

AlexaTM 20B: Few-Shot Learning Using a Large-Scale Multilingual Seq2Seq Model

Autor: Soltan, Saleh, Ananthakrishnan, Shankar, FitzGerald, Jack, Gupta, Rahul, Hamza, Wael, Khan, Haidar, Peris, Charith, Rawls, Stephen, Rosenbaum, Andy, Rumshisky, Anna, Prakash, Chandana Satya, Sridhar, Mukund, Triefenbach, Fabian, Verma, Apurv, Tur, Gokhan, Natarajan, Prem

In this work, we demonstrate that multilingual large-scale sequence-to-sequence (seq2seq) models, pre-trained on a mixture of denoising and Causal Language Modeling (CLM) tasks, are more efficient few-shot learners than decoder-only models on various

Externí odkaz: http://arxiv.org/abs/2208.01448

Zobrazit plný text záznamu

Report

Alexa Teacher Model: Pretraining and Distilling Multi-Billion-Parameter Encoders for Natural Language Understanding Systems

Publikováno v: Proceedings of the 28th ACM SIGKDD Conference on Knowledge Discovery and Data Mining (KDD '22), August 14-18, 2022, Washington, DC, USA

We present results from a large-scale experiment on pretraining encoders with non-embedding parameter counts ranging from 700M to 9.3B, their subsequent distillation into smaller models ranging from 17M-170M parameters, and their application to the N

Externí odkaz: http://arxiv.org/abs/2206.07808

Zobrazit plný text záznamu

Report

Don't Parse, Insert: Multilingual Semantic Parsing with Insertion Based Decoding

Autor: Zhu, Qile, Khan, Haidar, Soltan, Saleh, Rawls, Stephen, Hamza, Wael

Semantic parsing is one of the key components of natural language understanding systems. A successful parse transforms an input utterance to an action that is easily understood by the system. Many algorithms have been proposed to solve this problem,

Externí odkaz: http://arxiv.org/abs/2010.03714

Zobrazit plný text záznamu

Report

Distribution of blackouts in the power grid and the Motter and Lai model

Autor: Kornbluth, Yosef, Cwilich, Gabriel, Buldyrev, Sergey V., Soltan, Saleh, Zussman, Gil

Publikováno v: Phys. Rev. E 103, 032309 (2021)

Carreras, Dobson and colleagues have studied empirical data on the sizes of the blackouts in real grids and modeled them by computer simulations using the direct current approximation. They have found that the resulting blackout sizes are distributed

Externí odkaz: http://arxiv.org/abs/2008.01141

Zobrazit plný text záznamu

Dissertation/ Thesis

Computational and Analytical Tools for Resilient and Secure Power Grids

Autor: Soltan, Saleh

Enhancing power grids' performance and resilience has been one of the greatest challenges in engineering and science over the past decade. A recent report by the National Academies of Sciences, Engineering, and Medicine along with other studies empha

Zobrazit plný text záznamu

Report

Protecting the Grid against IoT Botnets of High-Wattage Devices

Autor: Soltan, Saleh, Mittal, Prateek, Poor, H. Vincent

We provide methods to prevent line failures in the power grid caused by a newly revealed MAnipulation of Demand (MAD) attacks via an IoT botnet of high-wattage devices. In particular, we develop two algorithms named Securing Additional margin For gen

Externí odkaz: http://arxiv.org/abs/1808.03826

Zobrazit plný text záznamu

Vyhledávací nástroje:

Upřesnit hledání