Zobrazeno 1 - 1
of 1
pro vyhledávání: '"Huang, Benji Y. H."'
Recent research has shown that large language models (LLMs) can utilize low-precision floating point (FP) quantization to deliver high efficiency while maintaining original model accuracy. In particular, recent works have shown the effectiveness of n
Externí odkaz:
http://arxiv.org/abs/2411.18065