Výsledky vyhledávání

Report

Controllable Context Sensitivity and the Knob Behind It

Autor: Minder, Julian, Du, Kevin, Stoehr, Niklas, Monea, Giovanni, Wendler, Chris, West, Robert, Cotterell, Ryan

When making predictions, a language model must trade off how much it relies on its context vs. its prior knowledge. Choosing how sensitive the model is to its context is a fundamental functionality, as it enables the model to excel at tasks like retr

Externí odkaz: http://arxiv.org/abs/2411.07404

Zobrazit plný text záznamu

Report

Pyridyl-functionalized tripod molecules on Au(111): Interplay between H-bonding and metal coordination

Autor: Mohammad, Sajjan, Bisht, Neeta, Kannan, Anjana, Brandmeier, Anne, Neiss, Christian, Görling, Andreas, Stöhr, Meike, Maier, Sabine

The self-assembly of pyridyl-functionalized triazine (T4PT) was studied on Au(111) using low-temperature scanning tunneling microscopy (STM) under ultra-high vacuum conditions combined with density functional theory (DFT) calculations. In particular,

Externí odkaz: http://arxiv.org/abs/2411.01365

Zobrazit plný text záznamu

Report

Single V2 defect in 4H Silicon Carbide Schottky diode at low temperature

Autor: Steidl, Timo, Kuna, Pierre, Hesselmeier-Hüttmann, Erik, Liu, Di, Stöhr, Rainer, Knolle, Wolfgang, Ghezellou, Misagh, Ul-Hassan, Jawad, Schober, Maximilian, Bockstedte, Michel, Gali, Adam, Vorobyov, Vadim, Wrachtrup, Jörg

Nanoelectrical and photonic integration of quantum optical components is crucial for scalable solid-state quantum technologies. Silicon carbide stands out as a material with mature quantum defects and a wide variety of applications in semiconductor i

Externí odkaz: http://arxiv.org/abs/2410.09021

Zobrazit plný text záznamu

Report

Activation Scaling for Steering and Interpreting Language Models

Autor: Stoehr, Niklas, Du, Kevin, Snæbjarnarson, Vésteinn, West, Robert, Cotterell, Ryan, Schein, Aaron

Given the prompt "Rome is in", can we steer a language model to flip its prediction of an incorrect token "France" to a correct token "Italy" by only multiplying a few relevant activation vectors with scalars? We argue that successfully intervening o

Externí odkaz: http://arxiv.org/abs/2410.04962

Zobrazit plný text záznamu

Report

Importance sampling-based gradient method for dimension reduction in Poisson log-normal model

Autor: Batardière, Bastien, Chiquet, Julien, Kwon, Joon, Stoehr, Julien

High-dimensional count data poses significant challenges for statistical analysis, necessitating effective methods that also preserve explainability. We focus on a low rank constrained variant of the Poisson log-normal model, which relates the observ

Externí odkaz: http://arxiv.org/abs/2410.00476

Zobrazit plný text záznamu

Report

Pulsed magnetic field gradient on a tip for nanoscale imaging of spins

Autor: Schein-Lubomirsky, Leora, Mazor, Yarden, Stöhr, Rainer, Denisenko, Andrej, Finkler, Amit

Nanoscale magnetic resonance imaging (nanoMRI) aims at obtaining structure at the single molecule level. Most of the techniques for effecting a nanoMRI gradient use small permanent magnets. Here, we present a switchable magnetic field gradient on a t

Externí odkaz: http://arxiv.org/abs/2409.17690

Zobrazit plný text záznamu

Report

Fibrations by plane projective rational quartic curves in characteristic two

Autor: Hilario, Cesar, Stöhr, Karl-Otto

We give a complete classification, up to birational equivalence, of all fibrations by plane projective rational quartic curves in characteristic two.
Comment: 31 pages. Comments welcome at any time!

Externí odkaz: http://arxiv.org/abs/2409.05464

Zobrazit plný text záznamu

Report

Nanoscale Mapping of Magnetic Auto-oscillations with a single Spin Sensor

Autor: Hache, Toni, Anshu, Anshu, Shalomayeva, Tetyana, Stöhr, Rainer, Kern, Klaus, Wrachtrup, Jörg, Singha, Aparajita

Magnetic auto-oscillations are damping-compensated magnetization precessions. They can be generated in spin Hall nano-oscillators (SHNO) among others. Current research on these devices is dedicated to create next generation energy-efficient hardware

Externí odkaz: http://arxiv.org/abs/2406.15849

Zobrazit plný text záznamu

Report

Context versus Prior Knowledge in Language Models

Autor: Du, Kevin, Snæbjarnarson, Vésteinn, Stoehr, Niklas, White, Jennifer C., Schein, Aaron, Cotterell, Ryan

To answer a question, language models often need to integrate prior knowledge learned during pretraining and new information presented in context. We hypothesize that models perform this integration in a predictable way across different questions and

Externí odkaz: http://arxiv.org/abs/2404.04633

Zobrazit plný text záznamu

Report

Localizing Paragraph Memorization in Language Models

Autor: Stoehr, Niklas, Gordon, Mitchell, Zhang, Chiyuan, Lewis, Owen

Can we localize the weights and mechanisms used by a language model to memorize and recite entire paragraphs of its training data? In this paper, we show that while memorization is spread across multiple layers and model components, gradients of memo

Externí odkaz: http://arxiv.org/abs/2403.19851

Zobrazit plný text záznamu

Vyhledávací nástroje:

Upřesnit hledání