Výsledky vyhledávání

Report

On the localisation of the high-intensity region of simultaneous space-time foci

Autor: Archer, Emily, Sun, Bangshan, Walczak, Roman, Booth, Martin, Hooker, Simon

Simultaneous space-time focusing (SSTF) is often claimed to reduce the longitudinal extent of the high-intensity region near the focus, in contradiction to the original work on this topic. Here we seek to address this confusion by using numerical and

Externí odkaz: http://arxiv.org/abs/2410.18485

Zobrazit plný text záznamu

Report

M-RewardBench: Evaluating Reward Models in Multilingual Settings

Autor: Gureja, Srishti, Miranda, Lester James V., Islam, Shayekh Bin, Maheshwary, Rishabh, Sharma, Drishti, Winata, Gusti, Lambert, Nathan, Ruder, Sebastian, Hooker, Sara, Fadaee, Marzieh

Reward models (RMs) have driven the state-of-the-art performance of LLMs today by enabling the integration of human feedback into the language modeling process. However, RMs are primarily trained and evaluated in English, and their capabilities in mu

Externí odkaz: http://arxiv.org/abs/2410.15522

Zobrazit plný text záznamu

Report

VisAnatomy: An SVG Chart Corpus with Fine-Grained Semantic Labels

Autor: Chen, Chen, Bako, Hannah K., Yu, Peihong, Hooker, John, Joyal, Jeffrey, Wang, Simon C., Kim, Samuel, Wu, Jessica, Ding, Aoxue, Sandeep, Lara, Chen, Alex, Sinha, Chayanika, Liu, Zhicheng

Chart corpora, which comprise data visualizations and their semantic labels, are crucial for advancing visualization research. However, the labels in most existing chart corpora are high-level (e.g., chart types), hindering their utility for broader

Externí odkaz: http://arxiv.org/abs/2410.12268

Zobrazit plný text záznamu

Report

Mix Data or Merge Models? Optimizing for Diverse Multi-Task Learning

Autor: Aakanksha, Ahmadian, Arash, Goldfarb-Tarrant, Seraphina, Ermis, Beyza, Fadaee, Marzieh, Hooker, Sara

Large Language Models (LLMs) have been adopted and deployed worldwide for a broad variety of applications. However, ensuring their safe use remains a significant challenge. Preference training and safety measures often overfit to harms prevalent in W

Externí odkaz: http://arxiv.org/abs/2410.10801

Zobrazit plný text záznamu

Report

Nexus: Specialization meets Adaptability for Efficiently Training Mixture of Experts

Autor: Gritsch, Nikolas, Zhang, Qizhen, Locatelli, Acyr, Hooker, Sara, Üstün, Ahmet

Efficiency, specialization, and adaptability to new data distributions are qualities that are hard to combine in current Large Language Models. The Mixture of Experts (MoE) architecture has been the focus of significant research because its inherent

Externí odkaz: http://arxiv.org/abs/2408.15901

Zobrazit plný text záznamu

Report

Multilingual Arbitrage: Optimizing Data Pools to Accelerate Multilingual Progress

Autor: Odumakinde, Ayomide, D'souza, Daniel, Verga, Pat, Ermis, Beyza, Hooker, Sara

The use of synthetic data has played a critical role in recent state-of-art breakthroughs. However, overly relying on a single oracle teacher model to generate data has been shown to lead to model collapse and invite propagation of biases. These limi

Externí odkaz: http://arxiv.org/abs/2408.14960

Zobrazit plný text záznamu

Report

To Code, or Not To Code? Exploring Impact of Code in Pre-training

Autor: Aryabumi, Viraat, Su, Yixuan, Ma, Raymond, Morisot, Adrien, Zhang, Ivan, Locatelli, Acyr, Fadaee, Marzieh, Üstün, Ahmet, Hooker, Sara

Including code in the pre-training data mixture, even for models not specifically designed for code, has become a common practice in LLMs pre-training. While there has been anecdotal consensus among practitioners that code data plays a vital role in

Externí odkaz: http://arxiv.org/abs/2408.10914

Zobrazit plný text záznamu

Report

The Future of Open Human Feedback

Human feedback on conversations with language language models (LLMs) is central to how these systems learn about the world, improve their capabilities, and are steered toward desirable and safe behaviors. However, this feedback is mostly collected by

Externí odkaz: http://arxiv.org/abs/2408.16961

Zobrazit plný text záznamu

Report

Manipulable Semantic Components: a Computational Representation of Data Visualization Scenes

Autor: Liu, Zhicheng, Chen, Chen, Hooker, John

Various data visualization applications such as reverse engineering and interactive authoring require a vocabulary that describes the structure of visualization scenes and the procedure to manipulate them. A few scene abstractions have been proposed,

Externí odkaz: http://arxiv.org/abs/2408.04798

Zobrazit plný text záznamu

Vyhledávací nástroje:

Upřesnit hledání