Zobrazeno 1 - 8
of 8
pro vyhledávání: '"Kang, Jikun"'
Autor:
Kang, Jikun, Li, Xin Zhe, Chen, Xi, Kazemi, Amirreza, Sun, Qianyi, Chen, Boxing, Li, Dong, He, Xu, He, Quan, Wen, Feng, Hao, Jianye, Yao, Jun
Although Large Language Models (LLMs) achieve remarkable performance across various tasks, they often struggle with complex reasoning tasks, such as answering mathematical questions. Recent efforts to address this issue have primarily focused on leve
Externí odkaz:
http://arxiv.org/abs/2405.16265
Fine-tuning Large Language Models (LLMs) adapts a trained model to specific downstream tasks, significantly improving task-specific performance. Supervised Fine-Tuning (SFT) is a common approach, where an LLM is trained to produce desired answers. Ho
Externí odkaz:
http://arxiv.org/abs/2401.00907
Decision Transformer-based decision-making agents have shown the ability to generalize across multiple tasks. However, their performance relies on massive data and computation. We argue that this inefficiency stems from the forgetting phenomenon, in
Externí odkaz:
http://arxiv.org/abs/2305.16338
In cellular networks, User Equipment (UE) handoff from one Base Station (BS) to another, giving rise to the load balancing problem among the BSs. To address this problem, BSs can work collaboratively to deliver a smooth migration (or handoff) and sat
Externí odkaz:
http://arxiv.org/abs/2303.08003
Various automatic curriculum learning (ACL) methods have been proposed to improve the sample efficiency and final performance of deep reinforcement learning (DRL). They are designed to control how a DRL agent collects data, which is inspired by how h
Externí odkaz:
http://arxiv.org/abs/2110.03032
Large language model (LLM)-based decision-making agents have shown the ability to generalize across multiple tasks. However, their performance relies on massive data and compute. We argue that this inefficiency stems from the forgetting phenomenon, i
Externí odkaz:
https://explore.openaire.eu/search/publication?articleId=doi_dedup___::068371ebd3bdf8c377b8f1809479487f
Akademický článek
Tento výsledek nelze pro nepřihlášené uživatele zobrazit.
K zobrazení výsledku je třeba se přihlásit.
K zobrazení výsledku je třeba se přihlásit.