Zobrazeno 1 - 10
of 369
pro vyhledávání: '"Choi, Eugene A."'
Autor:
Grinsztajn, Nathan, Flet-Berliac, Yannis, Azar, Mohammad Gheshlaghi, Strub, Florian, Wu, Bill, Choi, Eugene, Cremer, Chris, Ahmadian, Arash, Chandak, Yash, Pietquin, Olivier, Geist, Matthieu
To better align Large Language Models (LLMs) with human judgment, Reinforcement Learning from Human Feedback (RLHF) learns a reward model and then optimizes it using regularized RL. Recently, direct alignment methods were introduced to learn such a f
Externí odkaz:
http://arxiv.org/abs/2406.19188
Contrastive Policy Gradient: Aligning LLMs on sequence-level scores in a supervised-friendly fashion
Autor:
Flet-Berliac, Yannis, Grinsztajn, Nathan, Strub, Florian, Choi, Eugene, Cremer, Chris, Ahmadian, Arash, Chandak, Yash, Azar, Mohammad Gheshlaghi, Pietquin, Olivier, Geist, Matthieu
Reinforcement Learning (RL) has been used to finetune Large Language Models (LLMs) using a reward model trained from preference data, to better align with human judgment. The recently introduced direct alignment methods, which are often simpler, more
Externí odkaz:
http://arxiv.org/abs/2406.19185
Both online and offline RLHF methods such as PPO and DPO have been extremely successful in aligning AI with human preferences. Despite their success, the existing methods suffer from a fundamental problem that their optimal solution is highly task-de
Externí odkaz:
http://arxiv.org/abs/2406.01660
Autor:
Choi, Eugene, Seong, Rak-Kyeong
Publikováno v:
Phys. Rev. D 109, 046015 (2024)
We present a collection of explicit formulas for the minimum volume of Sasaki-Einstein 5-manifolds. The cone over these 5-manifolds is a toric Calabi-Yau 3-fold. These toric Calabi-Yau 3-folds are associated with an infinite class of 4d N=1 supersymm
Externí odkaz:
http://arxiv.org/abs/2310.19276
Recent large-scale neural autoregressive sequence models have shown impressive performances on a variety of natural language generation tasks. However, their generated sequences often exhibit degenerate properties such as non-termination, undesirable
Externí odkaz:
http://arxiv.org/abs/2210.00660
Autor:
Choi, Eugene1 (AUTHOR), Shin, Jinho1 (AUTHOR), Ryu, Min-Hee2 (AUTHOR), Kim, Hyung-Don2 (AUTHOR) kimhdmd@amc.seoul.kr, Park, Young Soo1 (AUTHOR) youngspark@amc.seoul.kr
Publikováno v:
Scientific Reports. 7/31/2024, Vol. 14 Issue 1, p1-9. 9p.
Autor:
Han, Dong-Gyun, Kwak, Jinsook, Choi, Eugene, Seo, Seong-Wook, Vasileva, Elena A., Mishchenko, Natalia P., Fedoreyev, Sergey A., Stonik, Valentin A., Kim, Hyoung Kyu, Han, Jin, Byun, Jong Hyuk, Jung, Il Hyo, Yun, Hwayoung, Yoon, In-Soo
Publikováno v:
In Biomedicine & Pharmacotherapy June 2023 162
Autor:
Ripley, R. Taylor, Holmes, Hudson M., Whitlock, Richard S., Groth, Shawn S., Medina, Cristian G., Choi, Eugene A., Burt, Bryan M., Sugarbaker, Paul H.
Publikováno v:
In The Journal of Thoracic and Cardiovascular Surgery May 2023 165(5):1722-1730
Akademický článek
Tento výsledek nelze pro nepřihlášené uživatele zobrazit.
K zobrazení výsledku je třeba se přihlásit.
K zobrazení výsledku je třeba se přihlásit.
Autor:
Choi, Eugene, Han, Dong-Gyun, Park, Jeong-Eun, Lee, Ha-Yeon, Yoo, Jin-Wook, Jung, Yunjin, Song, Im-Sook, Yoon, In-Soo
Publikováno v:
In Journal of Chromatography B 1 October 2022 1208