Výsledky vyhledávání

Report

Familiarity-aware Evidence Compression for Retrieval Augmented Generation

Autor: Jung, Dongwon, Liu, Qin, Huang, Tenghao, Zhou, Ben, Chen, Muhao

Retrieval Augmented Generation (RAG) improves large language models (LMs) by incorporating non-parametric knowledge through evidence retrieval from external sources. However, it often struggles to filter out inconsistent and irrelevant information th

Externí odkaz: http://arxiv.org/abs/2409.12468

Zobrazit plný text záznamu

Report

FamiCom: Further Demystifying Prompts for Language Models with Task-Agnostic Performance Estimation

Autor: Li, Bangzheng, Zhou, Ben, Fu, Xingyu, Wang, Fei, Roth, Dan, Chen, Muhao

Language models have shown impressive in-context-learning capabilities, which allow them to benefit from input prompts and perform better on downstream end tasks. Existing works investigate the mechanisms behind this observation, and propose label-ag

Externí odkaz: http://arxiv.org/abs/2406.11243

Zobrazit plný text záznamu

Report

BIRD: A Trustworthy Bayesian Inference Framework for Large Language Models

Autor: Feng, Yu, Zhou, Ben, Lin, Weidong, Roth, Dan

Large language models primarily rely on inductive reasoning for decision making. This results in unreliable decisions when applied to real-world tasks that often present incomplete contexts and conditions. Thus, accurate probability estimation and ap

Externí odkaz: http://arxiv.org/abs/2404.12494

Zobrazit plný text záznamu

Report

Conceptual and Unbiased Reasoning in Language Models

Autor: Zhou, Ben, Zhang, Hongming, Chen, Sihao, Yu, Dian, Wang, Hongwei, Peng, Baolin, Roth, Dan, Yu, Dong

Conceptual reasoning, the ability to reason in abstract and high-level perspectives, is key to generalization in human cognition. However, limited study has been done on large language models' capability to perform conceptual reasoning. In this work,

Externí odkaz: http://arxiv.org/abs/2404.00205

Zobrazit plný text záznamu

Report

Cognitive Overload: Jailbreaking Large Language Models with Overloaded Logical Thinking

Autor: Xu, Nan, Wang, Fei, Zhou, Ben, Li, Bang Zheng, Xiao, Chaowei, Chen, Muhao

While large language models (LLMs) have demonstrated increasing power, they have also given rise to a wide range of harmful behaviors. As representatives, jailbreak attacks can provoke harmful or unethical responses from LLMs, even after safety align

Externí odkaz: http://arxiv.org/abs/2311.09827

Zobrazit plný text záznamu

Report

Deceptive Semantic Shortcuts on Reasoning Chains: How Far Can Models Go without Hallucination?

Autor: Li, Bangzheng, Zhou, Ben, Wang, Fei, Fu, Xingyu, Roth, Dan, Chen, Muhao

Despite the recent advancement in large language models (LLMs) and their high performances across numerous benchmarks, recent research has unveiled that LLMs suffer from hallucinations and unfaithful reasoning. This work studies a specific type of ha

Externí odkaz: http://arxiv.org/abs/2311.09702

Zobrazit plný text záznamu

Report

Sub-Sentence Encoder: Contrastive Learning of Propositional Semantic Representations

Autor: Chen, Sihao, Zhang, Hongming, Chen, Tong, Zhou, Ben, Yu, Wenhao, Yu, Dian, Peng, Baolin, Wang, Hongwei, Roth, Dan, Yu, Dong

We introduce sub-sentence encoder, a contrastively-learned contextual embedding model for fine-grained semantic representation of text. In contrast to the standard practice with sentence embeddings, where the meaning of an entire sequence of text is

Externí odkaz: http://arxiv.org/abs/2311.04335

Zobrazit plný text záznamu

Report

Building Interpretable and Reliable Open Information Retriever for New Domains Overnight

Autor: Yu, Xiaodong, Zhou, Ben, Roth, Dan

Information retrieval (IR) or knowledge retrieval, is a critical component for many down-stream tasks such as open-domain question answering (QA). It is also very challenging, as it requires succinctness, completeness, and correctness. In recent work

Externí odkaz: http://arxiv.org/abs/2308.04756

Zobrazit plný text záznamu

Report

Dynamic Clue Bottlenecks: Towards Interpretable-by-Design Visual Question Answering

Autor: Fu, Xingyu, Zhou, Ben, Chen, Sihao, Yatskar, Mark, Roth, Dan

Recent advances in multimodal large language models (LLMs) have shown extreme effectiveness in visual question answering (VQA). However, the design nature of these end-to-end models prevents them from being interpretable to humans, undermining trust

Externí odkaz: http://arxiv.org/abs/2305.14882

Zobrazit plný text záznamu

Report

Generic Temporal Reasoning with Differential Analysis and Explanation

Autor: Feng, Yu, Zhou, Ben, Wang, Haoyu, Jin, Helen, Roth, Dan

Publikováno v: Proceedings of ACL 2023

Temporal reasoning is the task of predicting temporal relations of event pairs. While temporal reasoning models can perform reasonably well on in-domain benchmarks, we have little idea of these systems' generalizability due to existing datasets' limi

Externí odkaz: http://arxiv.org/abs/2212.10467

Zobrazit plný text záznamu

Vyhledávací nástroje:

Upřesnit hledání