Zobrazeno 1 - 1
of 1
pro vyhledávání: '"Yue, Yuxuan"'
Large Language Models (LLMs) face significant deployment challenges due to their substantial memory requirements and the computational demands of auto-regressive text generation process. This paper addresses these challenges by focusing on the quanti
Externí odkaz:
http://arxiv.org/abs/2402.12065