Zobrazeno 1 - 3
of 3
pro vyhledávání: '"Rofin, Mark"'
Autor:
Hahn, Michael, Rofin, Mark
Empirical studies have identified a range of learnability biases and limitations of transformers, such as a persistent difficulty in learning to compute simple formal languages such as PARITY, and a bias towards low-degree functions. However, theoret
Externí odkaz:
http://arxiv.org/abs/2402.09963
The simplest way to obtain continuous interpolation between two points in high dimensional space is to draw a line between them. While previous works focused on the general connectivity between model parameters, we explored linear interpolation for p
Externí odkaz:
http://arxiv.org/abs/2211.12092
Autor:
Rofin, Mark, Mikhailov, Vladislav, Florinskiy, Mikhail, Kravchenko, Andrey, Tutubalina, Elena, Shavrina, Tatiana, Karabekyan, Daniel, Artemova, Ekaterina
The development of state-of-the-art systems in different applied areas of machine learning (ML) is driven by benchmarks, which have shaped the paradigm of evaluating generalisation capabilities from multiple perspectives. Although the paradigm is shi
Externí odkaz:
http://arxiv.org/abs/2210.05769