Zobrazeno 1 - 10
of 182
pro vyhledávání: '"Zeng, Zihao"'
Autor:
He, Xin, Zhang, Shunkang, Wang, Yuxin, Yin, Haiyan, Zeng, Zihao, Shi, Shaohuai, Tang, Zhenheng, Chu, Xiaowen, Tsang, Ivor, Soon, Ong Yew
Sparse Mixture of Experts (MoE) models, while outperforming dense Large Language Models (LLMs) in terms of performance, face significant deployment challenges during inference due to their high memory demands. Existing offloading techniques, which in
Externí odkaz:
http://arxiv.org/abs/2410.17954
Autor:
Lin, Bokai, Zeng, Zihao, Xiao, Zipeng, Kou, Siqi, Hou, Tianqi, Gao, Xiaofeng, Zhang, Hao, Deng, Zhijie
KV cache has become a de facto technique for the inference of large language models (LLMs), where tensors of shape (layer number, head number, sequence length, feature dimension) are introduced to cache historical information for self-attention. As t
Externí odkaz:
http://arxiv.org/abs/2410.14731
The KV-Cache technique has become the standard for the inference of large language models (LLMs). It caches states of self-attention to avoid recomputation. Yet, it is widely criticized that KV-Cache can become a bottleneck of the LLM inference syste
Externí odkaz:
http://arxiv.org/abs/2410.12876
Mixture of experts (MoE) has become the standard for constructing production-level large language models (LLMs) due to its promise to boost model capacity without causing significant overheads. Nevertheless, existing MoE methods usually enforce a con
Externí odkaz:
http://arxiv.org/abs/2406.13233
Publikováno v:
In Journal of Power Sources 30 November 2024 621
Publikováno v:
In Journal of Power Sources 30 November 2024 621
Autor:
Zeng, Zihao, Mei, Bing-Ang, Song, Guangrui, Hamza, Muhammad, Yan, Zerui, Wei, Qiulong, Feng, Huihua, Zuo, Zhengxing, Jia, Boru, Xiong, Rui
Publikováno v:
In Journal of Energy Storage 15 November 2024 102 Part A
Autor:
Zeng, Zihao, Lei, Hai, Li, Jiexiang, Wang, Bing, Lei, Shuya, Ji, Xiaobo, Sun, Wei, Yang, Yue, Ge, Peng
Publikováno v:
In Chemical Engineering Journal 1 November 2024 499
Autor:
Jiang, Hu, Zou, Qiang, Li, Yong, Jiang, Yao, Cui, Junfang, Zhou, Bin, Zhou, Wentao, Chen, Siyu, Zeng, Zihao
Publikováno v:
In Environmental Modelling and Software January 2025 183
Autor:
Liu, Shuangjin (AUTHOR), Zeng, Zihao (AUTHOR), Qi, Qi (AUTHOR), Yang, Qin (AUTHOR), Hu, Yiqiu (AUTHOR)
Publikováno v:
Psychology Research & Behavior Management. Jun2024, Vol. 17, p2477-2489. 13p.