Zobrazeno 1 - 2
of 2
pro vyhledávání: '"Ran, Yide"'
Autor:
Yao, Yuhang, Jin, Han, Shah, Alay Dilipbhai, Han, Shanshan, Hu, Zijian, Ran, Yide, Stripelis, Dimitris, Xu, Zhaozhuo, Avestimehr, Salman, He, Chaoyang
Large language models (LLMs) have surged in popularity and are extensively used in commercial applications, where the efficiency of model serving is crucial for the user experience. Most current research focuses on optimizing individual sub-procedure
Externí odkaz:
http://arxiv.org/abs/2408.00008
Autor:
Guo, Wentao, Long, Jikai, Zeng, Yimeng, Liu, Zirui, Yang, Xinyu, Ran, Yide, Gardner, Jacob R., Bastani, Osbert, De Sa, Christopher, Yu, Xiaodong, Chen, Beidi, Xu, Zhaozhuo
Zeroth-order optimization (ZO) is a memory-efficient strategy for fine-tuning Large Language Models using only forward passes. However, the application of ZO fine-tuning in memory-constrained settings such as mobile phones and laptops is still challe
Externí odkaz:
http://arxiv.org/abs/2406.02913