Výsledky vyhledávání - "Kang, Dongyeop"

Report

LearnerVoice: A Dataset of Non-Native English Learners' Spontaneous Speech

Autor: Kim, Haechan, Myung, Junho, Kim, Seoyoung, Lee, Sungpah, Kang, Dongyeop, Kim, Juho

Prevalent ungrammatical expressions and disfluencies in spontaneous speech from second language (L2) learners pose unique challenges to Automatic Speech Recognition (ASR) systems. However, few datasets are tailored to L2 learner speech. We publicly r

Externí odkaz: http://arxiv.org/abs/2407.04280

Zobrazit plný text záznamu

Report

Human-AI Collaborative Taxonomy Construction: A Case Study in Profession-Specific Writing Assistants

Autor: Lee, Minhwa, Kim, Zae Myung, Khetan, Vivek A., Kang, Dongyeop

Large Language Models (LLMs) have assisted humans in several writing tasks, including text revision and story generation. However, their effectiveness in supporting domain-specific writing, particularly in business contexts, is relatively less explor

Externí odkaz: http://arxiv.org/abs/2406.18675

Zobrazit plný text záznamu

Report

i-SRT: Aligning Large Multimodal Models for Videos by Iterative Self-Retrospective Judgment

Autor: Ahn, Daechul, Choi, Yura, Kim, San, Yu, Youngjae, Kang, Dongyeop, Choi, Jonghyun

Aligning Video Large Multimodal Models (VLMMs) face challenges such as modality misalignment and verbose responses. Although iterative approaches such as self-rewarding or iterative direct preference optimization (DPO) recently showed a significant i

Externí odkaz: http://arxiv.org/abs/2406.11280

Zobrazit plný text záznamu

Report

Joint Demonstration and Preference Learning Improves Policy Alignment with Human Feedback

Autor: Li, Chenliang, Zeng, Siliang, Liao, Zeyi, Li, Jiaxiang, Kang, Dongyeop, Garcia, Alfredo, Hong, Mingyi

Aligning human preference and value is an important requirement for building contemporary foundation models and embodied AI. However, popular approaches such as reinforcement learning with human feedback (RLHF) break down the task into successive sta

Externí odkaz: http://arxiv.org/abs/2406.06874

Zobrazit plný text záznamu

Report

On the Sequence Evaluation based on Stochastic Processes

Autor: Zhang, Tianhao, Lin, Zhexiao, Sheng, Zhecheng, Jiang, Chen, Kang, Dongyeop

Modeling and analyzing long sequences of text is an essential task for Natural Language Processing. Success in capturing long text dynamics using neural language models will facilitate many downstream tasks such as coherence evaluation, text generati

Externí odkaz: http://arxiv.org/abs/2405.17764

Zobrazit plný text záznamu

Report

Confidence Calibration and Rationalization for LLMs via Multi-Agent Deliberation

Autor: Yang, Ruixin, Rajagopal, Dheeraj, Hayati, Shirley Anugrah, Hu, Bin, Kang, Dongyeop

Uncertainty estimation is a significant issue for current large language models (LLMs) that are generally poorly calibrated and over-confident, especially with reinforcement learning from human feedback (RLHF). Unlike humans, whose decisions and conf

Externí odkaz: http://arxiv.org/abs/2404.09127

Zobrazit plný text záznamu

Report

Reinforcement Learning with Dynamic Multi-Reward Weighting for Multi-Style Controllable Generation

Autor: de Langis, Karin, Koo, Ryan, Kang, Dongyeop

Style is an integral component of text that expresses a diverse set of information, including interpersonal dynamics (e.g. formality) and the author's emotions or attitudes (e.g. disgust). Humans often employ multiple styles simultaneously. An open q

Externí odkaz: http://arxiv.org/abs/2402.14146

Zobrazit plný text záznamu

Report

Talk Through It: End User Directed Manipulation Learning

Autor: Winge, Carl, Imdieke, Adam, Aldeeb, Bahaa, Kang, Dongyeop, Desingh, Karthik

Training generalist robot agents is an immensely difficult feat due to the requirement to perform a huge range of tasks in many different environments. We propose selectively training robots based on end-user preferences instead. Given a factory mode

Externí odkaz: http://arxiv.org/abs/2402.12509

Zobrazit plný text záznamu

Report

Shallow Synthesis of Knowledge in GPT-Generated Texts: A Case Study in Automatic Related Work Composition

Autor: Martin-Boyle, Anna, Tyagi, Aahan, Hearst, Marti A., Kang, Dongyeop

Numerous AI-assisted scholarly applications have been developed to aid different stages of the research process. We present an analysis of AI-assisted scholarly writing generated with ScholaCite, a tool we built that is designed for organizing litera

Externí odkaz: http://arxiv.org/abs/2402.12255

Zobrazit plný text záznamu

Report

Chain-of-Instructions: Compositional Instruction Tuning on Large Language Models

Autor: Hayati, Shirley Anugrah, Jung, Taehee, Bodding-Long, Tristan, Kar, Sudipta, Sethy, Abhinav, Kim, Joo-Kyung, Kang, Dongyeop

Fine-tuning large language models (LLMs) with a collection of large and diverse instructions has improved the model's generalization to different tasks, even for unseen tasks. However, most existing instruction datasets include only single instructio

Externí odkaz: http://arxiv.org/abs/2402.11532

Zobrazit plný text záznamu

Vyhledávací nástroje:

Upřesnit hledání