Výsledky vyhledávání

Report

A Practical Review of Mechanistic Interpretability for Transformer-Based Language Models

Autor: Rai, Daking, Zhou, Yilun, Feng, Shi, Saparov, Abulhair, Yao, Ziyu

Mechanistic interpretability (MI) is an emerging sub-field of interpretability that seeks to understand a neural network model by reverse-engineering its internal computations. Recently, MI has garnered significant attention for interpreting transfor

Externí odkaz: http://arxiv.org/abs/2407.02646

Zobrazit plný text záznamu

Report

An Investigation of Neuron Activation as a Unified Lens to Explain Chain-of-Thought Eliciting Arithmetic Reasoning of LLMs

Autor: Rai, Daking, Yao, Ziyu

Large language models (LLMs) have shown strong arithmetic reasoning capabilities when prompted with Chain-of-Thought (CoT) prompts. However, we have only a limited understanding of how they are processed by LLMs. To demystify it, prior work has prima

Externí odkaz: http://arxiv.org/abs/2406.12288

Zobrazit plný text záznamu

Report

Improving Generalization in Language Model-Based Text-to-SQL Semantic Parsing: Two Simple Semantic Boundary-Based Techniques

Autor: Rai, Daking, Wang, Bailin, Zhou, Yilun, Yao, Ziyu

Compositional and domain generalization present significant challenges in semantic parsing, even for state-of-the-art semantic parsers based on pre-trained language models (LMs). In this study, we empirically investigate improving an LM's generalizat

Externí odkaz: http://arxiv.org/abs/2305.17378

Zobrazit plný text záznamu

Report

Explaining Large Language Model-Based Neural Semantic Parsers (Student Abstract)

Autor: Rai, Daking, Zhou, Yilun, Wang, Bailin, Yao, Ziyu

While large language models (LLMs) have demonstrated strong capability in structured prediction tasks such as semantic parsing, few amounts of research have explored the underlying mechanisms of their success. Our work studies different methods for e

Externí odkaz: http://arxiv.org/abs/2301.13820

Zobrazit plný text záznamu

Vyhledávací nástroje:

Upřesnit hledání