Zobrazeno 1 - 10
of 30 299
pro vyhledávání: '"A, Ziv"'
Understanding what defines a good representation in large language models (LLMs) is fundamental to both theoretical understanding and practical applications. In this paper, we investigate the quality of intermediate representations in various LLM arc
Externí odkaz:
http://arxiv.org/abs/2412.09563
Accurate uncertainty estimation is crucial for deploying neural networks in risk-sensitive applications such as medical diagnosis. Monte Carlo Dropout is a widely used technique for approximating predictive uncertainty by performing stochastic forwar
Externí odkaz:
http://arxiv.org/abs/2412.07169
We consider a class of optimization problems over stochastic variables where the algorithm can learn information about the value of any variable through a series of costly steps; we model this information acquisition process as a Markov Decision Proc
Externí odkaz:
http://arxiv.org/abs/2412.03860
The internal structure of Jupiter is constrained by the precise gravity field measurements by NASA's Juno mission, atmospheric data from the Galileo entry probe, and Voyager radio occultations. Not only are these observations few compared to the poss
Externí odkaz:
http://arxiv.org/abs/2412.01611
Autor:
Green, Matthew J., Ziv, Yoav, Rix, Hans-Walter, Maoz, Dan, Hamoudy, Ikram, Mazeh, Tsevi, Faigler, Simchon, Lam, Marco C., El-Badry, Kareem, Hume, George, Munday, James, Yarker, Paige
Stellar-mass black holes descend from high-mass stars, most of which had stellar binary companions. However, the number of those binary systems that survive the binary evolution and black hole formation is uncertain by multiple orders of magnitude. T
Externí odkaz:
http://arxiv.org/abs/2412.02082
Autor:
Mohseni, Masoud, Scherer, Artur, Johnson, K. Grace, Wertheim, Oded, Otten, Matthew, Aadit, Navid Anjum, Bresniker, Kirk M., Camsari, Kerem Y., Chapman, Barbara, Chatterjee, Soumitra, Dagnew, Gebremedhin A., Esposito, Aniello, Fahim, Farah, Fiorentino, Marco, Khalid, Abdullah, Kong, Xiangzhou, Kulchytskyy, Bohdan, Li, Ruoyu, Lott, P. Aaron, Markov, Igor L., McDermott, Robert F., Pedretti, Giacomo, Gajjar, Archit, Silva, Allyson, Sorebo, John, Spentzouris, Panagiotis, Steiner, Ziv, Torosov, Boyan, Venturelli, Davide, Visser, Robert J., Webb, Zak, Zhan, Xin, Cohen, Yonatan, Ronagh, Pooya, Ho, Alan, Beausoleil, Raymond G., Martinis, John M.
In the span of four decades, quantum computation has evolved from an intellectual curiosity to a potentially realizable technology. Today, small-scale demonstrations have become possible for quantum algorithmic primitives on hundreds of physical qubi
Externí odkaz:
http://arxiv.org/abs/2411.10406
Entropic optimal transport offers a computationally tractable approximation to the classical problem. In this note, we study the approximation rate of the entropic optimal transport map (in approaching the Brenier map) when the regularization paramet
Externí odkaz:
http://arxiv.org/abs/2411.07947
Autor:
Arefin, Md Rifat, Subbaraj, Gopeshh, Gontier, Nicolas, LeCun, Yann, Rish, Irina, Shwartz-Ziv, Ravid, Pal, Christopher
Decoder-only Transformers often struggle with complex reasoning tasks, particularly arithmetic reasoning requiring multiple sequential operations. In this work, we identify representation collapse in the model's intermediate layers as a key factor li
Externí odkaz:
http://arxiv.org/abs/2411.02344
The Gromov-Wasserstein (GW) distance enables comparing metric measure spaces based solely on their internal structure, making it invariant to isomorphic transformations. This property is particularly useful for comparing datasets that naturally admit
Externí odkaz:
http://arxiv.org/abs/2410.18006
Autor:
Scully, Ziv, Doval, Laura
We consider search problems with nonobligatory inspection and single-item or combinatorial selection. A decision maker is presented with a number of items, each of which contains an unknown price, and can pay an inspection cost to observe the item's
Externí odkaz:
http://arxiv.org/abs/2410.19011