Výsledky vyhledávání - "Spector, A. A."

Report

LoLCATs: On Low-Rank Linearizing of Large Language Models

Autor: Zhang, Michael, Arora, Simran, Chalamala, Rahul, Wu, Alan, Spector, Benjamin, Singhal, Aaryan, Ramesh, Krithik, Ré, Christopher

Recent works show we can linearize large language models (LLMs) -- swapping the quadratic attentions of popular Transformer-based LLMs with subquadratic analogs, such as linear attention -- avoiding the expensive pretraining costs. However, linearizi

Externí odkaz: http://arxiv.org/abs/2410.10254

Zobrazit plný text záznamu

Report

Persistent homology classifies parameter dependence of patterns in Turing systems

Autor: Spector, Reemon, Harrington, Heather A., Gaffney, Eamonn A.

This paper illustrates a further application of topological data analysis to the study of self-organising models for chemical and biological systems. In particular, we investigate whether topological summaries can capture the parameter dependence of

Externí odkaz: http://arxiv.org/abs/2409.20491

Zobrazit plný text záznamu

Report

A sparse resolution of the DiPerna-Majda gap problem for $2$D Euler equations

Autor: Domínguez, Oscar, Spector, Daniel

A central question which originates in the celebrated work in the 1980's of DiPerna and Majda asks what is the optimal decay $f > 0$ such that uniform rates $|\omega|(Q) \leq f(|Q|)$ of the vorticity maximal functions guarantee strong convergence wit

Externí odkaz: http://arxiv.org/abs/2409.02344

Zobrazit plný text záznamu

Report

Design and Performance of the ALPS II Regeneration Cavity

Autor: Kozlowski, Todd, Wei, Li-Wei, Spector, Aaron D., Hallal, Ayman, Fraedrich, Henry, Brotherton, Daniel C., Oceano, Isabella, Ejlli, Aldo, Grote, Hartmut, Hollis, Harold, Karan, Kanioar, Mueller, Guido, Tanner, D. B., Willke, Benno, Lindner, Axel

The Regeneration Cavity (RC) is a critical component of the Any Light Particle Search II (ALPS II) experiment. It increases the signal from possible axions and axion-like particles in the experiment by nearly four orders of magnitude. The total round

Externí odkaz: http://arxiv.org/abs/2408.13218

Zobrazit plný text záznamu

Report

Conversational Prompt Engineering

Autor: Ein-Dor, Liat, Toledo-Ronen, Orith, Spector, Artem, Gretz, Shai, Dankin, Lena, Halfon, Alon, Katz, Yoav, Slonim, Noam

Prompts are how humans communicate with LLMs. Informative prompts are essential for guiding LLMs to produce the desired output. However, prompt engineering is often tedious and time-consuming, requiring significant expertise, limiting its widespread

Externí odkaz: http://arxiv.org/abs/2408.04560

Zobrazit plný text záznamu

Report

Generational Computation Reduction in Informal Counterexample-Driven Genetic Programming

Autor: Helmuth, Thomas, Pantridge, Edward, Frazier, James Gunder, Spector, Lee

Counterexample-driven genetic programming (CDGP) uses specifications provided as formal constraints to generate the training cases used to evaluate evolving programs. It has also been extended to combine formal constraints and user-provided training

Externí odkaz: http://arxiv.org/abs/2408.12604

Zobrazit plný text záznamu

Report

Stay Tuned: An Empirical Study of the Impact of Hyperparameters on LLM Tuning in Real-World Applications

Autor: Halfon, Alon, Gretz, Shai, Arviv, Ofir, Spector, Artem, Toledo-Ronen, Orith, Katz, Yoav, Ein-Dor, Liat, Shmueli-Scheuer, Michal, Slonim, Noam

Fine-tuning Large Language Models (LLMs) is an effective method to enhance their performance on downstream tasks. However, choosing the appropriate setting of tuning hyperparameters (HPs) is a labor-intensive and computationally expensive process. He

Externí odkaz: http://arxiv.org/abs/2407.18990

Zobrazit plný text záznamu

Report

$BMO$ and gradient estimates for solutions of critical elliptic equations

Autor: Chen, You-Wei Benson, Manfredi, Juan, Spector, Daniel

In this paper we explore several applications of the recently introduced spaces of functions of bounded $\beta$-dimensional mean oscillation for $\beta \in (0,n]$ to regularity theory of critical exponent elliptic equations. We first show that functi

Externí odkaz: http://arxiv.org/abs/2407.13884

Zobrazit plný text záznamu

Report

Just read twice: closing the recall gap for recurrent language models

Autor: Arora, Simran, Timalsina, Aman, Singhal, Aaryan, Spector, Benjamin, Eyuboglu, Sabri, Zhao, Xinyi, Rao, Ashish, Rudra, Atri, Ré, Christopher

Recurrent large language models that compete with Transformers in language modeling perplexity are emerging at a rapid rate (e.g., Mamba, RWKV). Excitingly, these architectures use a constant amount of memory during inference. However, due to the lim

Externí odkaz: http://arxiv.org/abs/2407.05483

Zobrazit plný text záznamu

Report

Potential trace inequalities via a Calder\'on-type theorem

Autor: Mihula, Zdeněk, Pick, Luboš, Spector, Daniel

We establish an approach to trace inequalities for potential-type operators based on an appropriate modification of an interpolation theorem due to Calder\'on. We develop a general theoretical tool for establishing boundedness of notoriously difficul

Externí odkaz: http://arxiv.org/abs/2407.03986

Zobrazit plný text záznamu

Vyhledávací nástroje:

Upřesnit hledání