Výsledky vyhledávání

Report

Does Knowledge Localization Hold True? Surprising Differences Between Entity and Relation Perspectives in Language Models

Autor: Wei, Yifan, Yu, Xiaoyan, Weng, Yixuan, Ma, Huanhuan, Zhang, Yuanzhe, Zhao, Jun, Liu, Kang

Large language models encapsulate knowledge and have demonstrated superior performance on various natural language processing tasks. Recent studies have localized this knowledge to specific model parameters, such as the MLP weights in intermediate la

Externí odkaz: http://arxiv.org/abs/2409.00617

Zobrazit plný text záznamu

Report

Large Language Models as Foundations for Next-Gen Dense Retrieval: A Comprehensive Empirical Assessment

Autor: Luo, Kun, Qin, Minghao, Liu, Zheng, Xiao, Shitao, Zhao, Jun, Liu, Kang

Pretrained language models like BERT and T5 serve as crucial backbone encoders for dense retrieval. However, these models often exhibit limited generalization capabilities and face challenges in improving in domain accuracy. Recent research has explo

Externí odkaz: http://arxiv.org/abs/2408.12194

Zobrazit plný text záznamu

Report

Towards Robust Knowledge Unlearning: An Adversarial Framework for Assessing and Improving Unlearning Robustness in Large Language Models

Autor: Yuan, Hongbang, Jin, Zhuoran, Cao, Pengfei, Chen, Yubo, Liu, Kang, Zhao, Jun

LLM have achieved success in many fields but still troubled by problematic content in the training corpora. LLM unlearning aims at reducing their influence and avoid undesirable behaviours. However, existing unlearning methods remain vulnerable to ad

Externí odkaz: http://arxiv.org/abs/2408.10682

Zobrazit plný text záznamu

Report

ONSEP: A Novel Online Neural-Symbolic Framework for Event Prediction Based on Large Language Model

Autor: Yu, Xuanqing, Sun, Wangtao, Li, Jingwei, Liu, Kang, Liu, Chengbao, Tan, Jie

In the realm of event prediction, temporal knowledge graph forecasting (TKGF) stands as a pivotal technique. Previous approaches face the challenges of not utilizing experience during testing and relying on a single short-term history, which limits a

Externí odkaz: http://arxiv.org/abs/2408.07840

Zobrazit plný text záznamu

Report

Knowledge in Superposition: Unveiling the Failures of Lifelong Knowledge Editing for Large Language Models

Autor: Hu, Chenhui, Cao, Pengfei, Chen, Yubo, Liu, Kang, Zhao, Jun

Knowledge editing aims to update outdated or incorrect knowledge in large language models (LLMs). However, current knowledge editing methods have limited scalability for lifelong editing. This study explores the fundamental reason why knowledge editi

Externí odkaz: http://arxiv.org/abs/2408.07413

Zobrazit plný text záznamu

Report

TSC: A Simple Two-Sided Constraint against Over-Smoothing

Autor: Peng, Furong, Liu, Kang, Lu, Xuan, Qian, Yuhua, Yan, Hongren, Ma, Chao

Graph Convolutional Neural Network (GCN), a widely adopted method for analyzing relational data, enhances node discriminability through the aggregation of neighboring information. Usually, stacking multiple layers can improve the performance of GCN b

Externí odkaz: http://arxiv.org/abs/2408.03152

Zobrazit plný text záznamu

Report

Citekit: A Modular Toolkit for Large Language Model Citation Generation

Autor: Shen, Jiajun, Zhou, Tong, Zhao, Suifeng, Chen, Yubo, Liu, Kang

Enabling Large Language Models (LLMs) to generate citations in Question-Answering (QA) tasks is an emerging paradigm aimed at enhancing the verifiability of their responses when LLMs are utilizing external references to generate an answer. However, t

Externí odkaz: http://arxiv.org/abs/2408.04662

Zobrazit plný text záznamu

Report

Universal Approximation of Dynamical Systems by Semi-Autonomous Neural ODEs and Applications

Autor: Li, Ziqian, Liu, Kang, Liverani, Lorenzo, Zuazua, Enrique

In this paper, we introduce semi-autonomous neural ordinary differential equations (SA-NODEs), a variation of the vanilla NODEs, employing fewer parameters. We investigate the universal approximation properties of SA-NODEs for dynamical systems from

Externí odkaz: http://arxiv.org/abs/2407.17092

Zobrazit plný text záznamu

Report

Beyond Instruction Following: Evaluating Inferential Rule Following of Large Language Models

Autor: Sun, Wangtao, Zhang, Chenxiang, Zhang, Xueyou, Huang, Ziyang, Xu, Haotian, Chen, Pei, He, Shizhu, Zhao, Jun, Liu, Kang

Although Large Language Models (LLMs) have demonstrated strong instruction-following ability, they are further supposed to be controlled and guided by rules in real-world scenarios to be safe, accurate, and intelligent. This demands the possession of

Externí odkaz: http://arxiv.org/abs/2407.08440

Zobrazit plný text záznamu

Report

WTU-EVAL: A Whether-or-Not Tool Usage Evaluation Benchmark for Large Language Models

Autor: Ning, Kangyun, Su, Yisong, Lv, Xueqiang, Zhang, Yuanzhe, Liu, Jian, Liu, Kang, Xu, Jinan

Although Large Language Models (LLMs) excel in NLP tasks, they still need external tools to extend their ability. Current research on tool learning with LLMs often assumes mandatory tool use, which does not always align with real-world situations, wh

Externí odkaz: http://arxiv.org/abs/2407.12823

Zobrazit plný text záznamu

Vyhledávací nástroje:

Upřesnit hledání