Výsledky vyhledávání - "Choi, Kwanseok"

Report

MoNDE: Mixture of Near-Data Experts for Large-Scale Sparse Models

Autor: Kim, Taehyun, Choi, Kwanseok, Cho, Youngmock, Cho, Jaehoon, Lee, Hyuk-Jae, Sim, Jaewoong

Mixture-of-Experts (MoE) large language models (LLM) have memory requirements that often exceed the GPU memory capacity, requiring costly parameter movement from secondary memories to the GPU for expert computation. In this work, we present Mixture o

Externí odkaz: http://arxiv.org/abs/2405.18832

Zobrazit plný text záznamu

Vyhledávací nástroje:

Upřesnit hledání