Zobrazeno 1 - 10
of 25 507
pro vyhledávání: '"Hooker A"'
Simultaneous space-time focusing (SSTF) is often claimed to reduce the longitudinal extent of the high-intensity region near the focus, in contradiction to the original work on this topic. Here we seek to address this confusion by using numerical and
Externí odkaz:
http://arxiv.org/abs/2410.18485
Autor:
Gureja, Srishti, Miranda, Lester James V., Islam, Shayekh Bin, Maheshwary, Rishabh, Sharma, Drishti, Winata, Gusti, Lambert, Nathan, Ruder, Sebastian, Hooker, Sara, Fadaee, Marzieh
Reward models (RMs) have driven the state-of-the-art performance of LLMs today by enabling the integration of human feedback into the language modeling process. However, RMs are primarily trained and evaluated in English, and their capabilities in mu
Externí odkaz:
http://arxiv.org/abs/2410.15522
Autor:
Chen, Chen, Bako, Hannah K., Yu, Peihong, Hooker, John, Joyal, Jeffrey, Wang, Simon C., Kim, Samuel, Wu, Jessica, Ding, Aoxue, Sandeep, Lara, Chen, Alex, Sinha, Chayanika, Liu, Zhicheng
Chart corpora, which comprise data visualizations and their semantic labels, are crucial for advancing visualization research. However, the labels in most existing chart corpora are high-level (e.g., chart types), hindering their utility for broader
Externí odkaz:
http://arxiv.org/abs/2410.12268
Autor:
Aakanksha, Ahmadian, Arash, Goldfarb-Tarrant, Seraphina, Ermis, Beyza, Fadaee, Marzieh, Hooker, Sara
Large Language Models (LLMs) have been adopted and deployed worldwide for a broad variety of applications. However, ensuring their safe use remains a significant challenge. Preference training and safety measures often overfit to harms prevalent in W
Externí odkaz:
http://arxiv.org/abs/2410.10801
Efficiency, specialization, and adaptability to new data distributions are qualities that are hard to combine in current Large Language Models. The Mixture of Experts (MoE) architecture has been the focus of significant research because its inherent
Externí odkaz:
http://arxiv.org/abs/2408.15901
The use of synthetic data has played a critical role in recent state-of-art breakthroughs. However, overly relying on a single oracle teacher model to generate data has been shown to lead to model collapse and invite propagation of biases. These limi
Externí odkaz:
http://arxiv.org/abs/2408.14960
Autor:
Aryabumi, Viraat, Su, Yixuan, Ma, Raymond, Morisot, Adrien, Zhang, Ivan, Locatelli, Acyr, Fadaee, Marzieh, Üstün, Ahmet, Hooker, Sara
Including code in the pre-training data mixture, even for models not specifically designed for code, has become a common practice in LLMs pre-training. While there has been anecdotal consensus among practitioners that code data plays a vital role in
Externí odkaz:
http://arxiv.org/abs/2408.10914
Autor:
Don-Yehiya, Shachar, Burtenshaw, Ben, Astudillo, Ramon Fernandez, Osborne, Cailean, Jaiswal, Mimansa, Kuo, Tzu-Sheng, Zhao, Wenting, Shenfeld, Idan, Peng, Andi, Yurochkin, Mikhail, Kasirzadeh, Atoosa, Huang, Yangsibo, Hashimoto, Tatsunori, Jernite, Yacine, Vila-Suero, Daniel, Abend, Omri, Ding, Jennifer, Hooker, Sara, Kirk, Hannah Rose, Choshen, Leshem
Human feedback on conversations with language language models (LLMs) is central to how these systems learn about the world, improve their capabilities, and are steered toward desirable and safe behaviors. However, this feedback is mostly collected by
Externí odkaz:
http://arxiv.org/abs/2408.16961
Various data visualization applications such as reverse engineering and interactive authoring require a vocabulary that describes the structure of visualization scenes and the procedure to manipulate them. A few scene abstractions have been proposed,
Externí odkaz:
http://arxiv.org/abs/2408.04798
Autor:
Vieira, J., Cros, B., Muggli, P., Andriyash, I. A., Apsimon, O., Backhouse, M., Benedetti, C., Bulanov, S. S., Caldwell, A., Chen, Min, Cilento, V., Corde, S., D'Arcy, R., Diederichs, S., Ericson, E., Esarey, E., Farmer, J., Fedeli, L., Formenti, A., Foster, B., Garten, M., Geddes, C. G. R., Grismayer, T., Hogan, M. J., Hooker, S., Huebl, A., Jalas, S., Kirchen, M., Lehe, R., Leemans, W., Li, Boyuan, Lindström, C. A., Losito, R., Mitchell, C. E., Mori, W. B., Piot, P., Terzani, D., Thévenet, M., Turner, M., Vay, J. -L., Völker, D., Zhang, Jie, Zhang, W.
The workshop focused on the application of ANAs to particle physics keeping in mind the ultimate goal of a collider at the energy frontier (10\,TeV, e$^+$/e$^-$, e$^-$/e$^-$, or $\gamma\gamma$). The development of ANAs is conducted at universities an
Externí odkaz:
http://arxiv.org/abs/2408.03968