Výsledky vyhledávání

Report

The Larger the Better? Improved LLM Code-Generation via Budget Reallocation

Autor: Hassid, Michael, Remez, Tal, Gehring, Jonas, Schwartz, Roy, Adi, Yossi

It is a common belief that large language models (LLMs) are better than smaller-sized ones. However, larger models also require significantly more time and compute during inference. This begs the question: what happens when both models operate under

Externí odkaz: http://arxiv.org/abs/2404.00725

Zobrazit plný text záznamu

Report

On the Semantic Latent Space of Diffusion-Based Text-to-Speech Models

Autor: Varshavsky-Hassid, Miri, Hirsch, Roy, Cohen, Regev, Golany, Tomer, Freedman, Daniel, Rivlin, Ehud

The incorporation of Denoising Diffusion Models (DDMs) in the Text-to-Speech (TTS) domain is rising, providing great value in synthesizing high quality speech. Although they exhibit impressive audio quality, the extent of their semantic capabilities

Externí odkaz: http://arxiv.org/abs/2402.12423

Zobrazit plný text záznamu

Report

Transformers are Multi-State RNNs

Autor: Oren, Matanel, Hassid, Michael, Yarden, Nir, Adi, Yossi, Schwartz, Roy

Transformers are considered conceptually different from the previous generation of state-of-the-art NLP models - recurrent neural networks (RNNs). In this work, we demonstrate that decoder-only transformers can in fact be conceptualized as unbounded

Externí odkaz: http://arxiv.org/abs/2401.06104

Zobrazit plný text záznamu

Akademický článek

Civilian knowledge industries and the ascendance of small and medium-sized states in world politics

Autor: Nir Hassid, Eviatar Matania

Publikováno v: Humanities & Social Sciences Communications, Vol 11, Iss 1, Pp 1-10 (2024)

Abstract In the evolving landscape of international politics, the ascent of small and medium-sized (SMS) states in knowledge industries is notable. As these states, exemplified by Israel, Sweden, Singapore, and the United Arab Emirates, harness advan

Externí odkaz: https://doaj.org/article/7c8410f5e2164f28bc430c07bdf570c9

Zobrazit plný text záznamu

Report

EXPRESSO: A Benchmark and Analysis of Discrete Expressive Speech Resynthesis

Autor: Nguyen, Tu Anh, Hsu, Wei-Ning, D'Avirro, Antony, Shi, Bowen, Gat, Itai, Fazel-Zarani, Maryam, Remez, Tal, Copet, Jade, Synnaeve, Gabriel, Hassid, Michael, Kreuk, Felix, Adi, Yossi, Dupoux, Emmanuel

Recent work has shown that it is possible to resynthesize high-quality speech based, not on text, but on low bitrate discrete units that have been learned in a self-supervised fashion and can therefore capture expressive aspects of speech that are ha

Externí odkaz: http://arxiv.org/abs/2308.05725

Zobrazit plný text záznamu

Report

Finding the SWEET Spot: Analysis and Improvement of Adaptive Inference in Low Resource Settings

Autor: Rotem, Daniel, Hassid, Michael, Mamou, Jonathan, Schwartz, Roy

Adaptive inference is a simple method for reducing inference costs. The method works by maintaining multiple classifiers of different capacities, and allocating resources to each test instance according to its difficulty. In this work, we compare the

Externí odkaz: http://arxiv.org/abs/2306.02307

Zobrazit plný text záznamu

Report

Textually Pretrained Speech Language Models

Autor: Hassid, Michael, Remez, Tal, Nguyen, Tu Anh, Gat, Itai, Conneau, Alexis, Kreuk, Felix, Copet, Jade, Defossez, Alexandre, Synnaeve, Gabriel, Dupoux, Emmanuel, Schwartz, Roy, Adi, Yossi

Speech language models (SpeechLMs) process and generate acoustic data only, without textual supervision. In this work, we propose TWIST, a method for training SpeechLMs using a warm-start from a pretrained textual language models. We show using both

Externí odkaz: http://arxiv.org/abs/2305.13009

Zobrazit plný text záznamu

Report

How Much Does Attention Actually Attend? Questioning the Importance of Attention in Pretrained Transformers

Autor: Hassid, Michael, Peng, Hao, Rotem, Daniel, Kasai, Jungo, Montero, Ivan, Smith, Noah A., Schwartz, Roy

The attention mechanism is considered the backbone of the widely-used Transformer architecture. It contextualizes the input by computing input-specific attention matrices. We find that this mechanism, while powerful and elegant, is not as important a

Externí odkaz: http://arxiv.org/abs/2211.03495

Zobrazit plný text záznamu

Report

Efficient Methods for Natural Language Processing: A Survey

Recent work in natural language processing (NLP) has yielded appealing results from scaling model parameters and training data; however, using only scale to improve performance means that resource consumption also grows. Such resources include data,

Externí odkaz: http://arxiv.org/abs/2209.00099

Zobrazit plný text záznamu

Akademický článek

Low Ventricular Stiffness Is Associated With Suboptimal Outcomes in Patients With a Single Right Ventricle After the Fontan Operation: A Novel Phenotype

Autor: Shahryar M. Chowdhury, Andrew M. Atz, Eric M. Graham, Varsha M. Bandisode, John F. Rhodes, Arni C. Nutting, Carolyn Taylor, Andrew Savage, Marc Hassid, Minoo Kavarana, Donald Menick

Publikováno v: Journal of the American Heart Association: Cardiovascular and Cerebrovascular Disease, Vol 13, Iss 17 (2024)

Background Despite a rigorous screening process, including cardiac catheterization, a subset of patients with a single right ventricle (SRV) demonstrates suboptimal short‐term outcomes after the Fontan operation. The goal of this study was to perfo

Externí odkaz: https://doaj.org/article/955c6fb2c7fc4b1ba0a2016042d2cda7

Zobrazit plný text záznamu

Vyhledávací nástroje:

Upřesnit hledání