Zobrazeno 1 - 1
of 1
pro vyhledávání: '"Chahed, Ilyas"'
Autor:
Zuo, Jingwei, Velikanov, Maksim, Rhaiem, Dhia Eddine, Chahed, Ilyas, Belkada, Younes, Kunsch, Guillaume, Hacid, Hakim
In this technical report, we present Falcon Mamba 7B, a new base large language model based on the novel Mamba architecture. Falcon Mamba 7B is trained on 5.8 trillion tokens with carefully selected data mixtures. As a pure Mamba-based model, Falcon
Externí odkaz:
http://arxiv.org/abs/2410.05355