Zobrazeno 1 - 1
of 1
pro vyhledávání: '"Wache, Magdalena"'
Autor:
Bushnaq, Lucius, Heimersheim, Stefan, Goldowsky-Dill, Nicholas, Braun, Dan, Mendel, Jake, Hänni, Kaarel, Griffin, Avery, Stöhler, Jörn, Wache, Magdalena, Hobbhahn, Marius
Mechanistic interpretability aims to understand the behavior of neural networks by reverse-engineering their internal computations. However, current methods struggle to find clear interpretations of neural network activations because a decomposition
Externí odkaz:
http://arxiv.org/abs/2405.10928