Zobrazeno 1 - 10
of 33
pro vyhledávání: '"Chang, Hanwen"'
Large language models (LLMs) have demonstrated remarkable performance and tremendous potential across a wide range of tasks. However, deploying these models has been challenging due to the astronomical amount of model parameters, which requires a dem
Externí odkaz:
http://arxiv.org/abs/2311.00502
Autor:
Shen, Haihao, Meng, Hengyu, Dong, Bo, Wang, Zhe, Zafrir, Ofir, Ding, Yi, Luo, Yu, Chang, Hanwen, Gao, Qun, Wang, Ziheng, Boudoukh, Guy, Wasserblat, Moshe
In recent years, Transformer-based language models have become the standard approach for natural language processing tasks. However, stringent throughput and latency requirements in industrial applications are limiting their adoption. To mitigate the
Externí odkaz:
http://arxiv.org/abs/2306.16601
Autor:
Shen, Haihao, Zafrir, Ofir, Dong, Bo, Meng, Hengyu, Ye, Xinyu, Wang, Zhe, Ding, Yi, Chang, Hanwen, Boudoukh, Guy, Wasserblat, Moshe
Transformer-based language models have become the standard approach to solving natural language processing tasks. However, industry adoption usually requires the maximum throughput to comply with certain latency constraints that prevents Transformer
Externí odkaz:
http://arxiv.org/abs/2211.07715
Publikováno v:
Transactions of the Indian Institute of Metals; Aug2024, Vol. 77 Issue 8, p2183-2189, 7p
Publikováno v:
In Molecular Catalysis October 2019 477
Publikováno v:
Journal of Physical Chemistry A; 6/6/2024, Vol. 128 Issue 22, p4425-4438, 14p
Publikováno v:
Journal of Physical Chemistry A; 6/6/2024, Vol. 128 Issue 22, p4412-4424, 13p
Publikováno v:
Aerospace (MDPI Publishing); May2024, Vol. 11 Issue 5, p390, 13p
Publikováno v:
Journal of Physical Chemistry A; 4/18/2024, Vol. 128 Issue 15, p2997-3006, 10p
Autor:
Wang, Zengguang, Dong, Zhenglin, Li, Yiming, Jiao, Xin, Liu, Yihao, Chang, Hanwen, Gan, Yaokai
Publikováno v:
Biomedicines; Apr2024, Vol. 12 Issue 4, p904, 21p