Zobrazeno 1 - 3
of 3
pro vyhledávání: '"Dhankhar, Nischay"'
Autor:
Pfeiffer, Pascal, Singer, Philipp, Babakhin, Yauhen, Fodor, Gabor, Dhankhar, Nischay, Ambati, Sri Satish
We present H2O-Danube3, a series of small language models consisting of H2O-Danube3-4B, trained on 6T tokens and H2O-Danube3-500M, trained on 4T tokens. Our models are pre-trained on high quality Web data consisting of primarily English tokens in thr
Externí odkaz:
http://arxiv.org/abs/2407.09276
Autor:
Singer, Philipp, Pfeiffer, Pascal, Babakhin, Yauhen, Jeblick, Maximilian, Dhankhar, Nischay, Fodor, Gabor, Ambati, Sri Satish
We present H2O-Danube, a series of small 1.8B language models consisting of H2O-Danube-1.8B, trained on 1T tokens, and the incremental improved H2O-Danube2-1.8B trained on an additional 2T tokens. Our models exhibit highly competitive metrics across
Externí odkaz:
http://arxiv.org/abs/2401.16818
Publikováno v:
In Medical Engineering and Physics June 2024 128