Výsledky vyhledávání

Report

Monet: Mixture of Monosemantic Experts for Transformers

Autor: Park, Jungwoo, Ahn, Young Jin, Kim, Kee-Eung, Kang, Jaewoo

Understanding the internal computations of large language models (LLMs) is crucial for aligning them with human values and preventing undesirable behaviors like toxic content generation. However, mechanistic interpretability is hindered by polysemant

Externí odkaz: http://arxiv.org/abs/2412.04139

Zobrazit plný text záznamu

Report

Phenomenology of Dirac neutrino EFTs up to dimension six

Autor: Biswas, Anirban, Chun, Eung Jin, Mandal, Sanjoy, Nanda, Dibyendu

The gauge singlet right-handed neutrinos are one of the essential fields in neutrino mass models that explain tiny masses of active neutrinos. We consider the effective field theory of the Standard Model extended with these fields under the assumptio

Externí odkaz: http://arxiv.org/abs/2411.17414

Zobrazit plný text záznamu

Report

GDPO: Learning to Directly Align Language Models with Diversity Using GFlowNets

Autor: Kwon, Oh Joon, Matsunaga, Daiki E., Kim, Kee-Eung

Publikováno v: EMNLP 2024

A critical component of the current generation of language models is preference alignment, which aims to precisely control the model's behavior to meet human needs and values. The most notable among such methods is Reinforcement Learning with Human F

Externí odkaz: http://arxiv.org/abs/2410.15096

Zobrazit plný text záznamu

Report

Zero-Shot Multi-Hop Question Answering via Monte-Carlo Tree Search with Large Language Models

Autor: Lee, Seongmin, Shin, Jaewook, Ahn, Youngjin, Seo, Seokin, Kwon, Ohjoon, Kim, Kee-Eung

Recent advances in large language models (LLMs) have significantly impacted the domain of multi-hop question answering (MHQA), where systems are required to aggregate information and infer answers from disparate pieces of text. However, the autoregre

Externí odkaz: http://arxiv.org/abs/2409.19382

Zobrazit plný text záznamu

Report

Hard Prompts Made Interpretable: Sparse Entropy Regularization for Prompt Tuning with RL

Autor: Choi, Yunseon, Bae, Sangmin, Ban, Seonghyun, Jeong, Minchan, Zhang, Chuheng, Song, Lei, Zhao, Li, Bian, Jiang, Kim, Kee-Eung

With the advent of foundation models, prompt tuning has positioned itself as an important technique for directing model behaviors and eliciting desired responses. Prompt tuning regards selecting appropriate keywords included into the input, thereby a

Externí odkaz: http://arxiv.org/abs/2407.14733

Zobrazit plný text záznamu

Report

SyncVSR: Data-Efficient Visual Speech Recognition with End-to-End Crossmodal Audio Token Synchronization

Autor: Ahn, Young Jin, Park, Jungwoo, Park, Sangha, Choi, Jonghyun, Kim, Kee-Eung

Visual Speech Recognition (VSR) stands at the intersection of computer vision and speech recognition, aiming to interpret spoken content from visual cues. A prominent challenge in VSR is the presence of homophenes-visually similar lip gestures that r

Externí odkaz: http://arxiv.org/abs/2406.12233

Zobrazit plný text záznamu

Report

Cogenesis by a sliding pNGB with symmetry non-restoration

Autor: Chun, Eung Jin, Das, Suruj Jyoti, He, Minxi, Jung, Tae Hyun, Sun, Jin

We show that a pseudo-Nambu-Goldstone boson (pNGB) with an initial misalignment angle can drive successful spontaneous baryogenesis, and become a good dark matter candidate if the corresponding global symmetry is non-restored at high temperatures. Co

Externí odkaz: http://arxiv.org/abs/2406.04180

Zobrazit plný text záznamu

Report

Kernel Metric Learning for In-Sample Off-Policy Evaluation of Deterministic RL Policies

Autor: Lee, Haanvid, Guntara, Tri Wahyu, Lee, Jongmin, Noh, Yung-Kyun, Kim, Kee-Eung

We consider off-policy evaluation (OPE) of deterministic target policies for reinforcement learning (RL) in environments with continuous action spaces. While it is common to use importance sampling for OPE, it suffers from high variance when the beha

Externí odkaz: http://arxiv.org/abs/2405.18792

Zobrazit plný text záznamu

Report

Non-Abelian Fractional Quantum Anomalous Hall States and First Landau Level Physics in Second Moir\'e Band of Twisted Bilayer MoTe2

Autor: Ahn, Cheong-Eung, Lee, Wonjun, Yananose, Kunihiro, Kim, Youngwook, Cho, Gil Young

Utilizing the realistic continuum description of twisted bilayer MoTe2 and many-body exact diagonalization calculation, we establish that the second moir\'e band of twisted bilayer MoTe2, at a small twist angle of approximately 2{\deg}, serves as an

Externí odkaz: http://arxiv.org/abs/2403.19155

Zobrazit plný text záznamu

Report

Bayesian Multi-Task Transfer Learning for Soft Prompt Tuning

Autor: Lee, Haeju, Jeong, Minchan, Yun, Se-Young, Kim, Kee-Eung

Prompt tuning, in which prompts are optimized to adapt large-scale pre-trained language models to downstream tasks instead of fine-tuning the full model parameters, has been shown to be particularly effective when the prompts are trained in a multi-t

Externí odkaz: http://arxiv.org/abs/2402.08594

Zobrazit plný text záznamu

Vyhledávací nástroje:

Upřesnit hledání