Výsledky vyhledávání

Report

Does Liking Yellow Imply Driving a School Bus? Semantic Leakage in Language Models

Autor: Gonen, Hila, Blevins, Terra, Liu, Alisa, Zettlemoyer, Luke, Smith, Noah A.

Despite their wide adoption, the biases and unintended behaviors of language models remain poorly understood. In this paper, we identify and characterize a phenomenon never discussed before, which we call semantic leakage, where models leak irrelevan

Externí odkaz: http://arxiv.org/abs/2408.06518

Zobrazit plný text záznamu

Report

Data Mixture Inference: What do BPE Tokenizers Reveal about their Training Data?

Autor: Hayase, Jonathan, Liu, Alisa, Choi, Yejin, Oh, Sewoong, Smith, Noah A.

The pretraining data of today's strongest language models is opaque; in particular, little is known about the proportions of various domains or languages represented. In this work, we tackle a task which we call data mixture inference, which aims to

Externí odkaz: http://arxiv.org/abs/2407.16607

Zobrazit plný text záznamu

Report

Decoding-Time Language Model Alignment with Multiple Objectives

Autor: Shi, Ruizhe, Chen, Yifang, Hu, Yushi, Liu, Alisa, Hajishirzi, Hannaneh, Smith, Noah A., Du, Simon

Aligning language models (LMs) to human preferences has emerged as a critical pursuit, enabling these models to better serve diverse user needs. Existing methods primarily focus on optimizing LMs for a single reward function, limiting their adaptabil

Externí odkaz: http://arxiv.org/abs/2406.18853

Zobrazit plný text záznamu

Report

A Taxonomy of Ambiguity Types for NLP

Autor: Li, Margaret Y., Liu, Alisa, Wu, Zhaofeng, Smith, Noah A.

Ambiguity is an critical component of language that allows for more effective communication between speakers, but is often ignored in NLP. Recent work suggests that NLP systems may struggle to grasp certain elements of human language understanding be

Externí odkaz: http://arxiv.org/abs/2403.14072

Zobrazit plný text záznamu

Report

Tuning Language Models by Proxy

Autor: Liu, Alisa, Han, Xiaochuang, Wang, Yizhong, Tsvetkov, Yulia, Choi, Yejin, Smith, Noah A.

Despite the general capabilities of large pretrained language models, they consistently benefit from further adaptation to better achieve desired behaviors. However, tuning these models has become increasingly resource-intensive, or impossible when m

Externí odkaz: http://arxiv.org/abs/2401.08565

Zobrazit plný text záznamu

Report

That was the last straw, we need more: Are Translation Systems Sensitive to Disambiguating Context?

Autor: Lee, Jaechan, Liu, Alisa, Ahia, Orevaoghene, Gonen, Hila, Smith, Noah A.

The translation of ambiguous text presents a challenge for translation systems, as it requires using the surrounding context to disambiguate the intended meaning as much as possible. While prior work has studied ambiguities that result from different

Externí odkaz: http://arxiv.org/abs/2310.14610

Zobrazit plný text záznamu

Report

Inverse Scaling: When Bigger Isn't Better

Publikováno v: Transactions on Machine Learning Research (TMLR), 10/2023, https://openreview.net/forum?id=DwgRm72GQF

Work on scaling laws has found that large language models (LMs) show predictable improvements to overall loss with increased scale (model size, training data, and compute). Here, we present evidence for the claim that LMs may show inverse scaling, or

Externí odkaz: http://arxiv.org/abs/2306.09479

Zobrazit plný text záznamu

Report

How Language Model Hallucinations Can Snowball

Autor: Zhang, Muru, Press, Ofir, Merrill, William, Liu, Alisa, Smith, Noah A.

A major risk of using language models in practical applications is their tendency to hallucinate incorrect statements. Hallucinations are often attributed to knowledge gaps in LMs, but we hypothesize that in some cases, when justifying previously gen

Externí odkaz: http://arxiv.org/abs/2305.13534

Zobrazit plný text záznamu

Report

We're Afraid Language Models Aren't Modeling Ambiguity

Autor: Liu, Alisa, Wu, Zhaofeng, Michael, Julian, Suhr, Alane, West, Peter, Koller, Alexander, Swayamdipta, Swabha, Smith, Noah A., Choi, Yejin

Ambiguity is an intrinsic feature of natural language. Managing ambiguity is a key part of human language understanding, allowing us to anticipate misunderstanding as communicators and revise our interpretations as listeners. As language models (LMs)

Externí odkaz: http://arxiv.org/abs/2304.14399

Zobrazit plný text záznamu

Report

Self-Instruct: Aligning Language Models with Self-Generated Instructions

Autor: Wang, Yizhong, Kordi, Yeganeh, Mishra, Swaroop, Liu, Alisa, Smith, Noah A., Khashabi, Daniel, Hajishirzi, Hannaneh

Large "instruction-tuned" language models (i.e., finetuned to respond to instructions) have demonstrated a remarkable ability to generalize zero-shot to new tasks. Nevertheless, they depend heavily on human-written instruction data that is often limi

Externí odkaz: http://arxiv.org/abs/2212.10560

Zobrazit plný text záznamu

Vyhledávací nástroje:

Upřesnit hledání