Zobrazeno 1 - 5
of 5
pro vyhledávání: '"Shen, Weihan"'
Autor:
Yang, Kai, Tao, Jian, Lyu, Jiafei, Ge, Chunjiang, Chen, Jiaxin, Li, Qimai, Shen, Weihan, Zhu, Xiaolong, Li, Xiu
Using reinforcement learning with human feedback (RLHF) has shown significant promise in fine-tuning diffusion models. Previous methods start by training a reward model that aligns with human preferences, then leverage RL techniques to fine-tune the
Externí odkaz:
http://arxiv.org/abs/2311.13231
Autor:
Chen, Hanmo, Tao, Stone, Chen, Jiaxin, Shen, Weihan, Li, Xihui, Yu, Chenghui, Cheng, Sikai, Zhu, Xiaolong, Li, Xiu
Inspired by organisms evolving through cooperation and competition between different populations on Earth, we study the emergence of artificial collective intelligence through massive-agent reinforcement learning. To this end, We propose a new massiv
Externí odkaz:
http://arxiv.org/abs/2301.01609
Publikováno v:
In Applied Acoustics 15 January 2025 228
Akademický článek
Tento výsledek nelze pro nepřihlášené uživatele zobrazit.
K zobrazení výsledku je třeba se přihlásit.
K zobrazení výsledku je třeba se přihlásit.
Autor:
Öhlund Wistbacka G; Acoustic Technology, Department of Electrical and Photonics Engineering, Technical University of Denmark, Kongens Lyngby DK-2800, Denmark gmawi@elektro.dtu.dk, s212644@dtu.dk, jbr@elektro.dtu.dk., Shen W; Acoustic Technology, Department of Electrical and Photonics Engineering, Technical University of Denmark, Kongens Lyngby DK-2800, Denmark gmawi@elektro.dtu.dk, s212644@dtu.dk, jbr@elektro.dtu.dk., Brunskog J; Acoustic Technology, Department of Electrical and Photonics Engineering, Technical University of Denmark, Kongens Lyngby DK-2800, Denmark gmawi@elektro.dtu.dk, s212644@dtu.dk, jbr@elektro.dtu.dk.
Publikováno v:
JASA express letters [JASA Express Lett] 2022 Oct; Vol. 2 (10), pp. 105202.