Zobrazeno 1 - 3
of 3
pro vyhledávání: '"Lu, Aojun"'
Efforts to overcome catastrophic forgetting have primarily centered around developing more effective Continual Learning (CL) methods. In contrast, less attention was devoted to analyzing the role of network architecture design (e.g., network depth, w
Externí odkaz:
http://arxiv.org/abs/2404.14829
Autor:
Bian, Ang, Li, Wei, Yuan, Hangjie, Yu, Chengrong, Wang, Mang, Zhao, Zixiang, Lu, Aojun, Ji, Pengliang, Feng, Tao
Model generalization ability upon incrementally acquiring dynamically updating knowledge from sequentially arriving tasks is crucial to tackle the sensitivity-stability dilemma in Continual Learning (CL). Weight loss landscape sharpness minimization
Externí odkaz:
http://arxiv.org/abs/2404.00986
As Pre-trained Language Models (PLMs), a popular approach for code intelligence, continue to grow in size, the computational cost of their usage has become prohibitively expensive. Prompt learning, a recent development in the field of natural languag
Externí odkaz:
http://arxiv.org/abs/2403.13588