Zobrazeno 1 - 10
of 29 297
pro vyhledávání: '"Inui, A"'
Autor:
Hiraoka, Tatsuya, Inui, Kentaro
This paper introduces repetition neurons, regarded as skill neurons responsible for the repetition problem in text generation tasks. These neurons are progressively activated more strongly as repetition continues, indicating that they perceive repeti
Externí odkaz:
http://arxiv.org/abs/2410.13497
Autor:
El-Shangiti, Ahmed Oumar, Hiraoka, Tatsuya, AlQuabeh, Hilal, Heinzerling, Benjamin, Inui, Kentaro
This paper investigates whether large language models (LLMs) utilize numerical attributes encoded in a low-dimensional subspace of the embedding space when answering logical comparison questions (e.g., Was Cristiano born before Messi?). We first iden
Externí odkaz:
http://arxiv.org/abs/2410.13194
The curvature perturbation in a model of constant-roll (CR) inflation is interpreted in view of the logarithmic duality discovered in Ref. [1] according to the $\delta N$ formalism. We confirm that the critical value $\beta:=\ddot{\varphi}/(H\dot{\va
Externí odkaz:
http://arxiv.org/abs/2409.13500
Entity tracking is essential for complex reasoning. To perform in-context entity tracking, language models (LMs) must bind an entity to its attribute (e.g., bind a container to its content) to recall attribute for a given entity. For example, given a
Externí odkaz:
http://arxiv.org/abs/2409.05448
Magneto-conductance in thin wires often exhibits complicated patterns due to the quantum interference of conduction electrons. These patterns reflect microscopic structures in the wires, such as defects or potential distributions. In this study, we p
Externí odkaz:
http://arxiv.org/abs/2409.02009
The complexities of chats pose significant challenges for machine translation models. Recognizing the need for a precise evaluation metric to address the issues of chat translation, this study introduces Multidimensional Quality Metrics for Chat Tran
Externí odkaz:
http://arxiv.org/abs/2408.16390
Publikováno v:
IJCNLP-AACL 2023 Student Research Workshop
The complexities of chats pose significant challenges for machine translation models. Recognizing the need for a precise evaluation metric to address the issues of chat translation, this study introduces Multidimensional Quality Metrics for Chat Tran
Externí odkaz:
http://arxiv.org/abs/2408.15543
Publikováno v:
AIED 2023. Lecture Notes in Computer Science(), vol 13916.pp.78-89. Springer
Automated Short Answer Scoring (SAS) is the task of automatically scoring a given input to a prompt based on rubrics and reference answers. Although SAS is useful in real-world applications, both rubrics and reference answers differ between prompts,
Externí odkaz:
http://arxiv.org/abs/2408.13966
Gravitational waves (GWs) from gravitational three-body decay (graviton Bremsstrahlung process) can leave an indelible signal at ultrahigh frequencies. We focus on a scenario where superheavy particles are produced gravitationally at a transition bet
Externí odkaz:
http://arxiv.org/abs/2408.10786
Autor:
Aoki, Yoichi, Kudo, Keito, Kuribayashi, Tatsuki, Sone, Shusaku, Taniguchi, Masaya, Sakaguchi, Keisuke, Inui, Kentaro
Multi-step reasoning instruction, such as chain-of-thought prompting, is widely adopted to explore better language models (LMs) performance. We report on the systematic strategy that LMs employ in such a multi-step reasoning process. Our controlled e
Externí odkaz:
http://arxiv.org/abs/2406.16078