Výsledky vyhledávání

Report

Evaluating Sparse Autoencoders on Targeted Concept Erasure Tasks

Autor: Karvonen, Adam, Rager, Can, Marks, Samuel, Nanda, Neel

Sparse Autoencoders (SAEs) are an interpretability technique aimed at decomposing neural network activations into interpretable units. However, a major bottleneck for SAE development has been the lack of high-quality performance metrics, with prior w

Externí odkaz: http://arxiv.org/abs/2411.18895

Zobrazit plný text záznamu

Report

Why quantum state verification cannot be both efficient and secure: a categorical approach

Autor: Wiesner, Fabian, Chaoui, Ziad, Kessler, Diana, Pappa, Anna, Karvonen, Martti

The advantage of quantum protocols lies in the inherent properties of the shared quantum states. These states are sometimes provided by sources that are not trusted, and therefore need to be verified. Finding secure and efficient quantum state verifi

Externí odkaz: http://arxiv.org/abs/2411.04767

Zobrazit plný text záznamu

Report

Measuring Progress in Dictionary Learning for Language Model Interpretability with Board Game Models

Autor: Karvonen, Adam, Wright, Benjamin, Rager, Can, Angell, Rico, Brinkmann, Jannik, Smith, Logan, Verdun, Claudio Mayrink, Bau, David, Marks, Samuel

What latent features are encoded in language model (LM) representations? Recent work on training sparse autoencoders (SAEs) to disentangle interpretable features in LM representations has shown significant promise. However, evaluating the quality of

Externí odkaz: http://arxiv.org/abs/2408.00113

Zobrazit plný text záznamu

Report

M\'obius-Transformed Trapezoidal Rule

Autor: Suzuki, Yuya, Hyvönen, Nuutti, Karvonen, Toni

We study numerical integration by combining the trapezoidal rule with a M\"obius transformation that maps the unit circle onto the real line. We prove that the resulting transformed trapezoidal rule attains the optimal rate of convergence if the inte

Externí odkaz: http://arxiv.org/abs/2407.13650

Zobrazit plný text záznamu

Report

Maximum mean discrepancies of Farey sequences

Autor: Karvonen, Toni, Zhigljavsky, Anatoly

We identify a large class of positive-semidefinite kernels for which a certain polynomial rate of convergence of maximum mean discrepancies of Farey sequences is equivalent to the Riemann hypothesis. This class includes all Mat\'ern kernels of order

Externí odkaz: http://arxiv.org/abs/2407.10214

Zobrazit plný text záznamu

Report

Inner automorphisms as 2-cells

Autor: Hofstra, Pieter, Karvonen, Martti

Publikováno v: Theory and Applications of Categories, Vol. 42, No. 2, 2024, pp. 19-40

Abstract inner automorphisms can be used to promote any category into a 2-category, and we study two-dimensional limits and colimits in the resulting 2-categories. Existing connected colimits and limits in the starting category become two-dimensional

Externí odkaz: http://arxiv.org/abs/2406.13647

Zobrazit plný text záznamu

Report

Helsinki Speech Challenge 2024

Autor: Ludvigsen, Martin, Karvonen, Elli, Juvonen, Markus, Siltanen, Samuli

The Helsinki Speech Challenge 2024 (HSC2024) invites researchers to enhance and deconvolve speech audio recordings. We recorded a dataset that challenges participants to apply speech enhancement and inverse problems techniques to recorded speech data

Externí odkaz: http://arxiv.org/abs/2406.04123

Zobrazit plný text záznamu

Report

Emergent World Models and Latent Variable Estimation in Chess-Playing Language Models

Autor: Karvonen, Adam

Language models have shown unprecedented capabilities, sparking debate over the source of their performance. Is it merely the outcome of learning syntactic patterns and surface level statistics, or do they extract semantics and a world model from the

Externí odkaz: http://arxiv.org/abs/2403.15498

Zobrazit plný text záznamu

Report

Construction of Optimal Algorithms for Function Approximation in Gaussian Sobolev Spaces

Autor: Suzuki, Yuya, Karvonen, Toni

This paper studies function approximation in Gaussian Sobolev spaces over the real line and measures the error in a Gaussian-weighted $L^p$-norm. We construct two linear approximation algorithms using $n$ function evaluations that achieve the optimal

Externí odkaz: http://arxiv.org/abs/2402.02917

Zobrazit plný text záznamu

Report

Towards a Unified Theory of Time-Varying Data

Autor: Bumpus, Benjamin Merlin, Fairbanks, James, Karvonen, Martti, Leal, Wilmer, Simard, Frédéric

What is a time-varying graph, or a time-varying topological space and more generally what does it mean for a mathematical structure to vary over time? Here we introduce categories of narratives: powerful tools for studying temporal graphs and other t

Externí odkaz: http://arxiv.org/abs/2402.00206

Zobrazit plný text záznamu

Vyhledávací nástroje:

Upřesnit hledání