Zobrazeno 1 - 10
of 89
pro vyhledávání: '"Kang, Dongjin"'
Autor:
Kim, Sunghwan, Kang, Dongjin, Kwon, Taeyoon, Chae, Hyungjoo, Won, Jungsoo, Lee, Dongha, Yeo, Jinyoung
Reward models are key in reinforcement learning from human feedback (RLHF) systems, aligning the model behavior with human preferences. Particularly in the math domain, there have been plenty of studies using reward models to align policies for impro
Externí odkaz:
http://arxiv.org/abs/2410.01729
Autor:
Chae, Hyungjoo, Kwon, Taeyoon, Moon, Seungjun, Song, Yongho, Kang, Dongjin, Ong, Kai Tzu-iunn, Kwak, Beong-woo, Bae, Seonghyeon, Hwang, Seung-won, Yeo, Jinyoung
This paper presents Coffee-Gym, a comprehensive RL environment for training models that provide feedback on code editing. Coffee-Gym includes two major components: (1) Coffee, a dataset containing humans' code edit traces for coding questions and mac
Externí odkaz:
http://arxiv.org/abs/2409.19715
Autor:
Lee, Suyeon, Kim, Sunghwan, Kim, Minju, Kang, Dongjin, Yang, Dongil, Kim, Harim, Kang, Minseok, Jung, Dayi, Kim, Min Hee, Lee, Seungbeen, Chung, Kyoung-Mee, Yu, Youngjae, Lee, Dongha, Yeo, Jinyoung
Recently, the demand for psychological counseling has significantly increased as more individuals express concerns about their mental health. This surge has accelerated efforts to improve the accessibility of counseling by using large language models
Externí odkaz:
http://arxiv.org/abs/2407.03103
Autor:
Kang, Dongjin, Kim, Sunghwan, Kwon, Taeyoon, Moon, Seungjun, Cho, Hyunsouk, Yu, Youngjae, Lee, Dongha, Yeo, Jinyoung
Emotional Support Conversation (ESC) is a task aimed at alleviating individuals' emotional distress through daily conversation. Given its inherent complexity and non-intuitive nature, ESConv dataset incorporates support strategies to facilitate the g
Externí odkaz:
http://arxiv.org/abs/2402.13211
Autor:
Kwon, Taeyoon, Ong, Kai Tzu-iunn, Kang, Dongjin, Moon, Seungjun, Lee, Jeong Ryong, Hwang, Dosik, Sim, Yongsik, Sohn, Beomseok, Lee, Dongha, Yeo, Jinyoung
Machine reasoning has made great progress in recent years owing to large language models (LLMs). In the clinical domain, however, most NLP-driven projects mainly focus on clinical classification or reading comprehension, and under-explore clinical re
Externí odkaz:
http://arxiv.org/abs/2312.07399
Autor:
Moon, Seungjun, Chae, Hyungjoo, Song, Yongho, Kwon, Taeyoon, Kang, Dongjin, Ong, Kai Tzu-iunn, Hwang, Seung-won, Yeo, Jinyoung
Code editing is an essential step towards reliable program synthesis to automatically correct critical errors generated from code LLMs. Recent studies have demonstrated that closed-source LLMs (i.e., ChatGPT and GPT-4) are capable of generating corre
Externí odkaz:
http://arxiv.org/abs/2311.07215
Akademický článek
Tento výsledek nelze pro nepřihlášené uživatele zobrazit.
K zobrazení výsledku je třeba se přihlásit.
K zobrazení výsledku je třeba se přihlásit.
Autor:
Bhattarai, Rajan Sharma, Das, Amitesh, Alzhrani, Rami M., Kang, Dongjin, Bhaduri, Sarit B., Boddu, Sai H.S.
Publikováno v:
In Materials Science & Engineering C 1 August 2017 77:895-903
Autor:
Yun, Heejeong1 (AUTHOR), Kang, Dongjin2 (AUTHOR), Kang, Youngeun3 (AUTHOR) jiyoon8936@gmail.com
Publikováno v:
Environment, Development & Sustainability. Jan2022, Vol. 24 Issue 1, p502-526. 25p.
Akademický článek
Tento výsledek nelze pro nepřihlášené uživatele zobrazit.
K zobrazení výsledku je třeba se přihlásit.
K zobrazení výsledku je třeba se přihlásit.