Zobrazeno 1 - 3
of 3
pro vyhledávání: '"Ambati, Sri Satish"'
Autor:
Galib, Shaikat, Wang, Shanshan, Xu, Guanshuo, Pfeiffer, Pascal, Chesler, Ryan, Landry, Mark, Ambati, Sri Satish
Smaller vision-language models (VLMs) are becoming increasingly important for privacy-focused, on-device applications due to their ability to run efficiently on consumer hardware for processing enterprise commercial documents and images. These models
Externí odkaz:
http://arxiv.org/abs/2410.13611
Autor:
Pfeiffer, Pascal, Singer, Philipp, Babakhin, Yauhen, Fodor, Gabor, Dhankhar, Nischay, Ambati, Sri Satish
We present H2O-Danube3, a series of small language models consisting of H2O-Danube3-4B, trained on 6T tokens and H2O-Danube3-500M, trained on 4T tokens. Our models are pre-trained on high quality Web data consisting of primarily English tokens in thr
Externí odkaz:
http://arxiv.org/abs/2407.09276
Autor:
Singer, Philipp, Pfeiffer, Pascal, Babakhin, Yauhen, Jeblick, Maximilian, Dhankhar, Nischay, Fodor, Gabor, Ambati, Sri Satish
We present H2O-Danube, a series of small 1.8B language models consisting of H2O-Danube-1.8B, trained on 1T tokens, and the incremental improved H2O-Danube2-1.8B trained on an additional 2T tokens. Our models exhibit highly competitive metrics across
Externí odkaz:
http://arxiv.org/abs/2401.16818