Výsledky vyhledávání - "Tsvetkov, P."

Report

Explore Theory of Mind: Program-guided adversarial data generation for theory of mind reasoning

Autor: Sclar, Melanie, Yu, Jane, Fazel-Zarandi, Maryam, Tsvetkov, Yulia, Bisk, Yonatan, Choi, Yejin, Celikyilmaz, Asli

Do large language models (LLMs) have theory of mind? A plethora of papers and benchmarks have been introduced to evaluate if current models have been able to develop this key ability of social intelligence. However, all rely on limited datasets with

Externí odkaz: http://arxiv.org/abs/2412.12175

Zobrazit plný text záznamu

Report

ComPO: Community Preferences for Language Model Personalization

Autor: Kumar, Sachin, Park, Chan Young, Tsvetkov, Yulia, Smith, Noah A., Hajishirzi, Hannaneh

Conventional algorithms for training language models (LMs) with human feedback rely on preferences that are assumed to account for an "average" user, disregarding subjectivity and finer-grained variations. Recent studies have raised concerns that agg

Externí odkaz: http://arxiv.org/abs/2410.16027

Zobrazit plný text záznamu

Report

Towards Realistic Evaluation of Commit Message Generation by Matching Online and Offline Settings

Autor: Tsvetkov, Petr, Eliseeva, Aleksandra, Dig, Danny, Bezzubov, Alexander, Golubev, Yaroslav, Bryksin, Timofey, Zharov, Yaroslav

Commit message generation (CMG) is a crucial task in software engineering that is challenging to evaluate correctly. When a CMG system is integrated into the IDEs and other products at JetBrains, we perform online evaluation based on user acceptance

Externí odkaz: http://arxiv.org/abs/2410.12046

Zobrazit plný text záznamu

Report

Model Swarms: Collaborative Search to Adapt LLM Experts via Swarm Intelligence

Autor: Feng, Shangbin, Wang, Zifeng, Wang, Yike, Ebrahimi, Sayna, Palangi, Hamid, Miculicich, Lesly, Kulshrestha, Achin, Rauschmayr, Nathalie, Choi, Yejin, Tsvetkov, Yulia, Lee, Chen-Yu, Pfister, Tomas

We propose Model Swarms, a collaborative search algorithm to adapt LLMs via swarm intelligence, the collective behavior guiding individual systems. Specifically, Model Swarms starts with a pool of LLM experts and a utility function. Guided by the bes

Externí odkaz: http://arxiv.org/abs/2410.11163

Zobrazit plný text záznamu

Report

Varying Shades of Wrong: Aligning LLMs with Wrong Answers Only

Autor: Yao, Jihan, Ding, Wenxuan, Feng, Shangbin, Wang, Lucy Lu, Tsvetkov, Yulia

In the absence of abundant reliable annotations for challenging tasks and contexts, how can we expand the frontier of LLM capabilities with potentially wrong answers? We focus on two research questions: (1) Can LLMs generate reliable preferences amon

Externí odkaz: http://arxiv.org/abs/2410.11055

Zobrazit plný text záznamu

Report

Biased AI can Influence Political Decision-Making

Autor: Fisher, Jillian, Feng, Shangbin, Aron, Robert, Richardson, Thomas, Choi, Yejin, Fisher, Daniel W., Pan, Jennifer, Tsvetkov, Yulia, Reinecke, Katharina

As modern AI models become integral to everyday tasks, concerns about their inherent biases and their potential impact on human decision-making have emerged. While bias in models are well-documented, less is known about how these biases influence hum

Externí odkaz: http://arxiv.org/abs/2410.06415

Zobrazit plný text záznamu

Report

Locating Information Gaps and Narrative Inconsistencies Across Languages: A Case Study of LGBT People Portrayals on Wikipedia

Autor: Samir, Farhan, Park, Chan Young, Field, Anjalie, Shwartz, Vered, Tsvetkov, Yulia

To explain social phenomena and identify systematic biases, much research in computational social science focuses on comparative text analyses. These studies often rely on coarse corpus-level statistics or local word-level analyses, mainly in English

Externí odkaz: http://arxiv.org/abs/2410.04282

Zobrazit plný text záznamu

Report

CulturalBench: a Robust, Diverse and Challenging Benchmark on Measuring the (Lack of) Cultural Knowledge of LLMs

Autor: Chiu, Yu Ying, Jiang, Liwei, Lin, Bill Yuchen, Park, Chan Young, Li, Shuyue Stella, Ravi, Sahithya, Bhatia, Mehar, Antoniak, Maria, Tsvetkov, Yulia, Shwartz, Vered, Choi, Yejin

To make large language models (LLMs) more helpful across diverse cultures, it is essential to have effective cultural knowledge benchmarks to measure and track our progress. Effective benchmarks need to be robust, diverse, and challenging. We introdu

Externí odkaz: http://arxiv.org/abs/2410.02677

Zobrazit plný text záznamu

Report

Evidence for spin droplets (ferrons) formation in the heavy fermion metal CeB$_6$ with dynamic charge stripes

Autor: Azarevich, A. N., Khrykina, O. N., Bolotina, N. B., Gridchina, V. G., Bogach, A. V., Demishev, S. V., Krasnorussky, V. N., Gavrilkin, S. Yu., Tsvetkov, A. Yu., Shitsevalova, N. Yu., Voronov, V. V., Kugel, K. I., Rakhmanov, A. L., Gabani, S., Flachbart, K., Sluchanko, N. E.

The presented studies of resistivity (R), thermal conductivity (k) and specific heat (C) at low temperature 1.8-7 K in magnetic field up to 90 kOe made it possible to detect for the first time the exponential field dependences R(H), 1/k(H), $C(H) \si

Externí odkaz: http://arxiv.org/abs/2409.04139

Zobrazit plný text záznamu

Report

JPEG-LM: LLMs as Image Generators with Canonical Codec Representations

Autor: Han, Xiaochuang, Ghazvininejad, Marjan, Koh, Pang Wei, Tsvetkov, Yulia

Recent work in image and video generation has been adopting the autoregressive LLM architecture due to its generality and potentially easy integration into multi-modal systems. The crux of applying autoregressive training in language generation to vi

Externí odkaz: http://arxiv.org/abs/2408.08459

Zobrazit plný text záznamu

Vyhledávací nástroje:

Upřesnit hledání