Zobrazeno 1 - 1
of 1
pro vyhledávání: '"Hong, Xinran"'
Autor:
Zeng, Binrui, Ji, Bin, Liu, Xiaodong, Yu, Jie, Li, Shasha, Ma, Jun, Li, Xiaopeng, Wang, Shangwen, Hong, Xinran
As large language models (LLMs) demonstrate exceptional performance across various domains, the deployment of these models on edge devices has emerged as a new trend. Quantization techniques, which reduce the size and memory footprint of LLMs, are ef
Externí odkaz:
http://arxiv.org/abs/2412.18135