Zobrazeno 1 - 10
of 21 356
pro vyhledávání: '"An, Eung"'
Understanding the internal computations of large language models (LLMs) is crucial for aligning them with human values and preventing undesirable behaviors like toxic content generation. However, mechanistic interpretability is hindered by polysemant
Externí odkaz:
http://arxiv.org/abs/2412.04139
The gauge singlet right-handed neutrinos are one of the essential fields in neutrino mass models that explain tiny masses of active neutrinos. We consider the effective field theory of the Standard Model extended with these fields under the assumptio
Externí odkaz:
http://arxiv.org/abs/2411.17414
Publikováno v:
EMNLP 2024
A critical component of the current generation of language models is preference alignment, which aims to precisely control the model's behavior to meet human needs and values. The most notable among such methods is Reinforcement Learning with Human F
Externí odkaz:
http://arxiv.org/abs/2410.15096
Recent advances in large language models (LLMs) have significantly impacted the domain of multi-hop question answering (MHQA), where systems are required to aggregate information and infer answers from disparate pieces of text. However, the autoregre
Externí odkaz:
http://arxiv.org/abs/2409.19382
Autor:
Choi, Yunseon, Bae, Sangmin, Ban, Seonghyun, Jeong, Minchan, Zhang, Chuheng, Song, Lei, Zhao, Li, Bian, Jiang, Kim, Kee-Eung
With the advent of foundation models, prompt tuning has positioned itself as an important technique for directing model behaviors and eliciting desired responses. Prompt tuning regards selecting appropriate keywords included into the input, thereby a
Externí odkaz:
http://arxiv.org/abs/2407.14733
Visual Speech Recognition (VSR) stands at the intersection of computer vision and speech recognition, aiming to interpret spoken content from visual cues. A prominent challenge in VSR is the presence of homophenes-visually similar lip gestures that r
Externí odkaz:
http://arxiv.org/abs/2406.12233
We show that a pseudo-Nambu-Goldstone boson (pNGB) with an initial misalignment angle can drive successful spontaneous baryogenesis, and become a good dark matter candidate if the corresponding global symmetry is non-restored at high temperatures. Co
Externí odkaz:
http://arxiv.org/abs/2406.04180
We consider off-policy evaluation (OPE) of deterministic target policies for reinforcement learning (RL) in environments with continuous action spaces. While it is common to use importance sampling for OPE, it suffers from high variance when the beha
Externí odkaz:
http://arxiv.org/abs/2405.18792
Utilizing the realistic continuum description of twisted bilayer MoTe2 and many-body exact diagonalization calculation, we establish that the second moir\'e band of twisted bilayer MoTe2, at a small twist angle of approximately 2{\deg}, serves as an
Externí odkaz:
http://arxiv.org/abs/2403.19155
Prompt tuning, in which prompts are optimized to adapt large-scale pre-trained language models to downstream tasks instead of fine-tuning the full model parameters, has been shown to be particularly effective when the prompts are trained in a multi-t
Externí odkaz:
http://arxiv.org/abs/2402.08594