Zobrazeno 1 - 10
of 25 651
pro vyhledávání: '"A, Dhingra"'
Autor:
Dhingra, Aviral
Gradient descent is a widely used iterative algorithm for finding local minima in multivariate functions. However, the final iterations often either overshoot the minima or make minimal progress, making it challenging to determine an optimal stopping
Externí odkaz:
http://arxiv.org/abs/2410.19448
Large Language Models (LLMs) are often augmented with external information as contexts, but this external information can sometimes be inaccurate or even intentionally misleading. We argue that robust LLMs should demonstrate situated faithfulness, dy
Externí odkaz:
http://arxiv.org/abs/2410.14675
We show that existing evaluations for fake news detection based on conventional sources, such as claims on fact-checking websites, result in an increasing accuracy over time for LLM-based detectors -- even after their knowledge cutoffs. This suggests
Externí odkaz:
http://arxiv.org/abs/2410.14651
Training-free embedding methods directly leverage pretrained large language models (LLMs) to embed text, bypassing the costly and complex procedure of contrastive learning. Previous training-free embedding methods have mainly focused on optimizing em
Externí odkaz:
http://arxiv.org/abs/2410.14635
Autor:
Ismayilzada, Mete, Circi, Defne, Sälevä, Jonne, Sirin, Hale, Köksal, Abdullatif, Dhingra, Bhuwan, Bosselut, Antoine, van der Plas, Lonneke, Ataman, Duygu
Large language models (LLMs) have demonstrated significant progress in various natural language generation and understanding tasks. However, their linguistic generalization capabilities remain questionable, raising doubts about whether these models l
Externí odkaz:
http://arxiv.org/abs/2410.12656
The study of low regularity Cauchy data for nonlinear dispersive PDEs has successfully been achieved using modulation spaces $M^{p,q}$ in recent years. In this paper, we study the inhomogeneous nonlinear Schr\"odinger equation (INLS) $$iu_t + \Delta
Externí odkaz:
http://arxiv.org/abs/2410.00869
Transformers have revolutionized deep learning and generative modeling to enable unprecedented advancements in natural language processing tasks and beyond. However, designing hardware accelerators for executing transformer models is challenging due
Externí odkaz:
http://arxiv.org/abs/2408.03397
Autor:
Dhingra, Archit, Zaz, M. Zaid
Spin crossover (SCO) complexes are highly promising candidates for a myriad of potential applications in room-temperature electronics; however, as it stands, establishing a clear connection between their spin-state switching and transport properties
Externí odkaz:
http://arxiv.org/abs/2407.17517
The mechanical complexity of flapping wings, their unsteady aerodynamic flow, and challenge of making measurements at the scale of a sub-gram flapping-wing flying insect robot (FIR) make its behavior hard to predict. Knowing the precise mapping from
Externí odkaz:
http://arxiv.org/abs/2407.00217
Flying insects can perform rapid, sophisticated maneuvers like backflips, sharp banked turns, and in-flight collision recovery. To emulate these in aerial robots weighing less than a gram, known as flying insect robots (FIRs), a fast and responsive c
Externí odkaz:
http://arxiv.org/abs/2406.20061