Zobrazeno 1 - 10
of 7 279
pro vyhledávání: '"Stoehr A"'
Autor:
Minder, Julian, Du, Kevin, Stoehr, Niklas, Monea, Giovanni, Wendler, Chris, West, Robert, Cotterell, Ryan
When making predictions, a language model must trade off how much it relies on its context vs. its prior knowledge. Choosing how sensitive the model is to its context is a fundamental functionality, as it enables the model to excel at tasks like retr
Externí odkaz:
http://arxiv.org/abs/2411.07404
Autor:
Mohammad, Sajjan, Bisht, Neeta, Kannan, Anjana, Brandmeier, Anne, Neiss, Christian, Görling, Andreas, Stöhr, Meike, Maier, Sabine
The self-assembly of pyridyl-functionalized triazine (T4PT) was studied on Au(111) using low-temperature scanning tunneling microscopy (STM) under ultra-high vacuum conditions combined with density functional theory (DFT) calculations. In particular,
Externí odkaz:
http://arxiv.org/abs/2411.01365
Autor:
Steidl, Timo, Kuna, Pierre, Hesselmeier-Hüttmann, Erik, Liu, Di, Stöhr, Rainer, Knolle, Wolfgang, Ghezellou, Misagh, Ul-Hassan, Jawad, Schober, Maximilian, Bockstedte, Michel, Gali, Adam, Vorobyov, Vadim, Wrachtrup, Jörg
Nanoelectrical and photonic integration of quantum optical components is crucial for scalable solid-state quantum technologies. Silicon carbide stands out as a material with mature quantum defects and a wide variety of applications in semiconductor i
Externí odkaz:
http://arxiv.org/abs/2410.09021
Autor:
Stoehr, Niklas, Du, Kevin, Snæbjarnarson, Vésteinn, West, Robert, Cotterell, Ryan, Schein, Aaron
Given the prompt "Rome is in", can we steer a language model to flip its prediction of an incorrect token "France" to a correct token "Italy" by only multiplying a few relevant activation vectors with scalars? We argue that successfully intervening o
Externí odkaz:
http://arxiv.org/abs/2410.04962
High-dimensional count data poses significant challenges for statistical analysis, necessitating effective methods that also preserve explainability. We focus on a low rank constrained variant of the Poisson log-normal model, which relates the observ
Externí odkaz:
http://arxiv.org/abs/2410.00476
Nanoscale magnetic resonance imaging (nanoMRI) aims at obtaining structure at the single molecule level. Most of the techniques for effecting a nanoMRI gradient use small permanent magnets. Here, we present a switchable magnetic field gradient on a t
Externí odkaz:
http://arxiv.org/abs/2409.17690
Autor:
Hilario, Cesar, Stöhr, Karl-Otto
We give a complete classification, up to birational equivalence, of all fibrations by plane projective rational quartic curves in characteristic two.
Comment: 31 pages. Comments welcome at any time!
Comment: 31 pages. Comments welcome at any time!
Externí odkaz:
http://arxiv.org/abs/2409.05464
Autor:
Hache, Toni, Anshu, Anshu, Shalomayeva, Tetyana, Stöhr, Rainer, Kern, Klaus, Wrachtrup, Jörg, Singha, Aparajita
Magnetic auto-oscillations are damping-compensated magnetization precessions. They can be generated in spin Hall nano-oscillators (SHNO) among others. Current research on these devices is dedicated to create next generation energy-efficient hardware
Externí odkaz:
http://arxiv.org/abs/2406.15849
Autor:
Du, Kevin, Snæbjarnarson, Vésteinn, Stoehr, Niklas, White, Jennifer C., Schein, Aaron, Cotterell, Ryan
To answer a question, language models often need to integrate prior knowledge learned during pretraining and new information presented in context. We hypothesize that models perform this integration in a predictable way across different questions and
Externí odkaz:
http://arxiv.org/abs/2404.04633
Can we localize the weights and mechanisms used by a language model to memorize and recite entire paragraphs of its training data? In this paper, we show that while memorization is spread across multiple layers and model components, gradients of memo
Externí odkaz:
http://arxiv.org/abs/2403.19851