Výsledky vyhledávání

Report

Autor: Kramar, Mirna, Hahn, Lauritz, Walczak, Aleksandra M, Mora, Thierry, Coppey, Mathieu

Cells use signalling pathways as windows into the environment to gather information, transduce it into their interior, and use it to drive behaviours. MAPK (ERK) is a highly conserved signalling pathway in eukaryotes, directing multiple fundamental c

Externí odkaz: http://arxiv.org/abs/2410.22571

Zobrazit plný text záznamu

Report

Dynamic cost allocation allows network-forming forager to switch between search strategies

Autor: Schick, Lisa, Kramar, Mirna, Alim, Karen

Publikováno v: PRX Life (2024) 2(3), 033005

Network-forming organisms, like fungi and slime molds, dynamically reorganize their networks during foraging. The resulting re-routing of resource flows within the organism's network can significantly impact local ecosystems. In current analysis limi

Externí odkaz: http://arxiv.org/abs/2408.17134

Zobrazit plný text záznamu

Report

Gemma Scope: Open Sparse Autoencoders Everywhere All At Once on Gemma 2

Autor: Lieberum, Tom, Rajamanoharan, Senthooran, Conmy, Arthur, Smith, Lewis, Sonnerat, Nicolas, Varma, Vikrant, Kramár, János, Dragan, Anca, Shah, Rohin, Nanda, Neel

Sparse autoencoders (SAEs) are an unsupervised method for learning a sparse decomposition of a neural network's latent representations into seemingly interpretable features. Despite recent excitement about their potential, research applications outsi

Externí odkaz: http://arxiv.org/abs/2408.05147

Zobrazit plný text záznamu

Report

Jumping Ahead: Improving Reconstruction Fidelity with JumpReLU Sparse Autoencoders

Autor: Rajamanoharan, Senthooran, Lieberum, Tom, Sonnerat, Nicolas, Conmy, Arthur, Varma, Vikrant, Kramár, János, Nanda, Neel

Sparse autoencoders (SAEs) are a promising unsupervised approach for identifying causally relevant and interpretable linear features in a language model's (LM) activations. To be useful for downstream tasks, SAEs need to decompose LM activations fait

Externí odkaz: http://arxiv.org/abs/2407.14435

Zobrazit plný text záznamu

Report

On scalable oversight with weak LLMs judging strong LLMs

Autor: Kenton, Zachary, Siegel, Noah Y., Kramár, János, Brown-Cohen, Jonah, Albanie, Samuel, Bulian, Jannis, Agarwal, Rishabh, Lindner, David, Tang, Yunhao, Goodman, Noah D., Shah, Rohin

Scalable oversight protocols aim to enable humans to accurately supervise superhuman AI. In this paper we study debate, where two AI's compete to convince a judge; consultancy, where a single AI tries to convince a judge that asks questions; and comp

Externí odkaz: http://arxiv.org/abs/2407.04622

Zobrazit plný text záznamu

Report

Dirac operators on the half-line: stability of spectrum and non-relativistic limit

Autor: Kramar, David, Krejcirik, David

We consider Dirac operators on the half-line, subject to generalised infinite-mass boundary conditions. We derive sufficient conditions which guarantee the stability of the spectrum against possibly non-self-adjoint potential perturbations and study

Externí odkaz: http://arxiv.org/abs/2405.10009

Zobrazit plný text záznamu

Report

Improving Dictionary Learning with Gated Sparse Autoencoders

Autor: Rajamanoharan, Senthooran, Conmy, Arthur, Smith, Lewis, Lieberum, Tom, Varma, Vikrant, Kramár, János, Shah, Rohin, Nanda, Neel

Recent work has found that sparse autoencoders (SAEs) are an effective technique for unsupervised discovery of interpretable features in language models' (LMs) activations, by finding sparse, linear reconstructions of LM activations. We introduce the

Externí odkaz: http://arxiv.org/abs/2404.16014

Zobrazit plný text záznamu

Report

AtP*: An efficient and scalable method for localizing LLM behaviour to components

Autor: Kramár, János, Lieberum, Tom, Shah, Rohin, Nanda, Neel

Activation Patching is a method of directly computing causal attributions of behavior to model components. However, applying it exhaustively requires a sweep with cost scaling linearly in the number of model components, which can be prohibitively exp

Externí odkaz: http://arxiv.org/abs/2403.00745

Zobrazit plný text záznamu

Akademický článek

MENINGOCOCCAL MENINGITIS WITH ARNOLD-CHIARI MALFORMATION: CASE REPORT

Autor: Kramar Lyubov Vasilievna, Larina Tatyana Yurievna, Khlynina Yuliya Olegovna

Publikováno v: Паёми Сино, Vol 26, Iss 4, Pp 685-693 (2024)

Arnold-Chiari malformation (ACM) is a developmental anomaly of the brain characterized by the descent of the cerebellar tonsils into the foramen magnum, leading to compression of the medulla oblongata and subsequent neurological symptoms. ACM can man

Externí odkaz: https://doaj.org/article/61519c2817b046f4a581873d2c33219c

Zobrazit plný text záznamu

Report

Explaining grokking through circuit efficiency

Autor: Varma, Vikrant, Shah, Rohin, Kenton, Zachary, Kramár, János, Kumar, Ramana

One of the most surprising puzzles in neural network generalisation is grokking: a network with perfect training accuracy but poor generalisation will, upon further training, transition to perfect generalisation. We propose that grokking occurs when

Externí odkaz: http://arxiv.org/abs/2309.02390

Zobrazit plný text záznamu

Vyhledávací nástroje:

Upřesnit hledání