Zobrazeno 1 - 2
of 2
pro vyhledávání: '"Ruhle, Victor"'
Autor:
Ding, Dujian, Mallick, Ankur, Wang, Chi, Sim, Robert, Mukherjee, Subhabrata, Ruhle, Victor, Lakshmanan, Laks V. S., Awadallah, Ahmed Hassan
Large language models (LLMs) excel in most NLP tasks but also require expensive cloud servers for deployment due to their size, while smaller models that can be deployed on lower cost (e.g., edge) devices, tend to lag behind in terms of response qual
Externí odkaz:
http://arxiv.org/abs/2404.14618
Autor:
Xia, Menglin, Zhang, Xuchao, Couturier, Camille, Zheng, Guoqing, Rajmohan, Saravan, Ruhle, Victor
Large language models (LLMs) enhanced with retrieval augmentation has shown great performance in many applications. However, the computational demands for these models pose a challenge when applying them to real-time tasks, such as composition assist
Externí odkaz:
http://arxiv.org/abs/2308.04215