Zobrazeno 1 - 10
of 46 722
pro vyhledávání: '"Spector, A. A."'
Autor:
Zhang, Michael, Arora, Simran, Chalamala, Rahul, Wu, Alan, Spector, Benjamin, Singhal, Aaryan, Ramesh, Krithik, Ré, Christopher
Recent works show we can linearize large language models (LLMs) -- swapping the quadratic attentions of popular Transformer-based LLMs with subquadratic analogs, such as linear attention -- avoiding the expensive pretraining costs. However, linearizi
Externí odkaz:
http://arxiv.org/abs/2410.10254
This paper illustrates a further application of topological data analysis to the study of self-organising models for chemical and biological systems. In particular, we investigate whether topological summaries can capture the parameter dependence of
Externí odkaz:
http://arxiv.org/abs/2409.20491
Autor:
Domínguez, Oscar, Spector, Daniel
A central question which originates in the celebrated work in the 1980's of DiPerna and Majda asks what is the optimal decay $f > 0$ such that uniform rates $|\omega|(Q) \leq f(|Q|)$ of the vorticity maximal functions guarantee strong convergence wit
Externí odkaz:
http://arxiv.org/abs/2409.02344
Autor:
Kozlowski, Todd, Wei, Li-Wei, Spector, Aaron D., Hallal, Ayman, Fraedrich, Henry, Brotherton, Daniel C., Oceano, Isabella, Ejlli, Aldo, Grote, Hartmut, Hollis, Harold, Karan, Kanioar, Mueller, Guido, Tanner, D. B., Willke, Benno, Lindner, Axel
The Regeneration Cavity (RC) is a critical component of the Any Light Particle Search II (ALPS II) experiment. It increases the signal from possible axions and axion-like particles in the experiment by nearly four orders of magnitude. The total round
Externí odkaz:
http://arxiv.org/abs/2408.13218
Autor:
Ein-Dor, Liat, Toledo-Ronen, Orith, Spector, Artem, Gretz, Shai, Dankin, Lena, Halfon, Alon, Katz, Yoav, Slonim, Noam
Prompts are how humans communicate with LLMs. Informative prompts are essential for guiding LLMs to produce the desired output. However, prompt engineering is often tedious and time-consuming, requiring significant expertise, limiting its widespread
Externí odkaz:
http://arxiv.org/abs/2408.04560
Counterexample-driven genetic programming (CDGP) uses specifications provided as formal constraints to generate the training cases used to evaluate evolving programs. It has also been extended to combine formal constraints and user-provided training
Externí odkaz:
http://arxiv.org/abs/2408.12604
Autor:
Halfon, Alon, Gretz, Shai, Arviv, Ofir, Spector, Artem, Toledo-Ronen, Orith, Katz, Yoav, Ein-Dor, Liat, Shmueli-Scheuer, Michal, Slonim, Noam
Fine-tuning Large Language Models (LLMs) is an effective method to enhance their performance on downstream tasks. However, choosing the appropriate setting of tuning hyperparameters (HPs) is a labor-intensive and computationally expensive process. He
Externí odkaz:
http://arxiv.org/abs/2407.18990
In this paper we explore several applications of the recently introduced spaces of functions of bounded $\beta$-dimensional mean oscillation for $\beta \in (0,n]$ to regularity theory of critical exponent elliptic equations. We first show that functi
Externí odkaz:
http://arxiv.org/abs/2407.13884
Autor:
Arora, Simran, Timalsina, Aman, Singhal, Aaryan, Spector, Benjamin, Eyuboglu, Sabri, Zhao, Xinyi, Rao, Ashish, Rudra, Atri, Ré, Christopher
Recurrent large language models that compete with Transformers in language modeling perplexity are emerging at a rapid rate (e.g., Mamba, RWKV). Excitingly, these architectures use a constant amount of memory during inference. However, due to the lim
Externí odkaz:
http://arxiv.org/abs/2407.05483
We establish an approach to trace inequalities for potential-type operators based on an appropriate modification of an interpolation theorem due to Calder\'on. We develop a general theoretical tool for establishing boundedness of notoriously difficul
Externí odkaz:
http://arxiv.org/abs/2407.03986