Zobrazeno 1 - 10
of 63 452
pro vyhledávání: '"A, Weston"'
Autor:
Prasad, Archiki, Yuan, Weizhe, Pang, Richard Yuanzhe, Xu, Jing, Fazel-Zarandi, Maryam, Bansal, Mohit, Sukhbaatar, Sainbayar, Weston, Jason, Yu, Jane
Self-alignment, whereby models learn to improve themselves without human annotation, is a rapidly growing research area. However, existing techniques often fail to improve complex reasoning tasks due to the difficulty of assigning correct rewards. An
Externí odkaz:
http://arxiv.org/abs/2411.04109
LLMs are typically trained to answer user questions or follow instructions similarly to how human experts respond. However, in the standard alignment framework they lack the basic ability of explicit thinking before answering. Thinking is important f
Externí odkaz:
http://arxiv.org/abs/2410.10630
Autor:
Molina, Isabella, Chomiuk, Laura, Linford, Justin D., Aydi, Elias, Mioduszewski, Amy J., Mukai, Koji, Sokolovsky, Kirill V., Strader, Jay, Craig, Peter, Dong, Dillon, Harris, Chelsea E., Nyamai, Miriam M., Rupen, Michael P., Sokoloski, Jennifer L., Walter, Frederick M., Weston, Jennifer H. S., Williams, Montana N.
V745 Sco is a Galactic symbiotic recurrent nova with nova eruptions in 1937, 1989 and 2014. We study the behavior of V745 Sco at radio wavelengths (0.6-37,GHz), covering both its 1989 and 2014 eruptions and informed by optical, X-ray, and $\gamma$-ra
Externí odkaz:
http://arxiv.org/abs/2410.01125
Autor:
Chappell, Digby, Yang, Zeyu, Clark, Angus B., Berkovic, Alexandre, Laganier, Colin, Baxter, Weston, Bello, Fernando, Kormushev, Petar, Rojas, Nicolas
Myoelectric prosthetic hands are typically controlled to move between discrete positions and do not provide sensory feedback to the user. In this work, we present and evaluate a closed-loop, continuous myoelectric prosthetic hand controller, that can
Externí odkaz:
http://arxiv.org/abs/2409.15578
Autor:
Zhang, Yiming, Chi, Jianfeng, Nguyen, Hailey, Upasani, Kartikeya, Bikel, Daniel M., Weston, Jason, Smith, Eric Michael
Text generation has a fundamental limitation almost by definition: there is no taking back tokens that have been generated, even when they are clearly problematic. In the context of language model safety, when a partial unsafe generation is produced,
Externí odkaz:
http://arxiv.org/abs/2409.14586
Autor:
Lupidi, Alisia, Gemmell, Carlos, Cancedda, Nicola, Dwivedi-Yu, Jane, Weston, Jason, Foerster, Jakob, Raileanu, Roberta, Lomeli, Maria
Large Language Models still struggle in challenging scenarios that leverage structured data, complex reasoning, or tool usage. In this paper, we propose Source2Synth: a new method that can be used for teaching LLMs new skills without relying on costl
Externí odkaz:
http://arxiv.org/abs/2409.08239
Autor:
Nicholl, M., Pasham, D. R., Mummery, A., Guolo, M., Gendreau, K., Dewangan, G. C., Ferrara, E. C., Remillard, R., Bonnerot, C., Chakraborty, J., Hajela, A., Dhillon, V. S., Gillan, A. F., Greenwood, J., Huber, M. E., Janiuk, A., Salvesen, G., van Velzen, S., Aamer, A., Alexander, K. D., Angus, C. R., Arzoumanian, Z., Auchettl, K., Berger, E., de Boer, T., Cendes, Y., Chambers, K. C., Chen, T. -W., Chornock, R., Fulton, M. D., Gao, H., Gillanders, J. H., Gomez, S., Gompertz, B. P., Fabian, A. C., Herman, J., Ingram, A., Kara, E., Laskar, T., Lawrence, A., Lin, C. -C., Lowe, T. B., Magnier, E. A., Margutti, R., McGee, S. L., Minguez, P., Moore, T., Nathan, E., Oates, S. R., Patra, K. C., Ramsden, P., Ravi, V., Ridley, E. J., Sheng, X., Smartt, S. J., Smith, K. W., Srivastav, S., Stein, R., Stevance, H. F., Turner, S. G. D., Wainscoat, R. J., Weston, J., Wevers, T., Young, D. R.
Quasi-periodic Eruptions (QPEs) are luminous bursts of soft X-rays from the nuclei of galaxies, repeating on timescales of hours to weeks. The mechanism behind these rare systems is uncertain, but most theories involve accretion disks around supermas
Externí odkaz:
http://arxiv.org/abs/2409.02181
Autor:
Liu, Zefang, Stacey, Weston M.
The dynamics of burning plasmas in tokamaks are crucial for advancing controlled thermonuclear fusion. This study introduces the NeuralPlasmaODE, a multi-region multi-timescale transport model to simulate the complex energy transfer processes in ITER
Externí odkaz:
http://arxiv.org/abs/2408.14404
Autor:
Nguyen, Thao, Li, Jeffrey, Oh, Sewoong, Schmidt, Ludwig, Weston, Jason, Zettlemoyer, Luke, Li, Xian
We propose a new method, instruction back-and-forth translation, to construct high-quality synthetic data grounded in world knowledge for aligning large language models (LLMs). Given documents from a web corpus, we generate and curate synthetic instr
Externí odkaz:
http://arxiv.org/abs/2408.04614
Autor:
Wang, Tianlu, Kulikov, Ilia, Golovneva, Olga, Yu, Ping, Yuan, Weizhe, Dwivedi-Yu, Jane, Pang, Richard Yuanzhe, Fazel-Zarandi, Maryam, Weston, Jason, Li, Xian
Model-based evaluation is at the heart of successful model development -- as a reward model for training, and as a replacement for human evaluation. To train such evaluators, the standard approach is to collect a large amount of human preference judg
Externí odkaz:
http://arxiv.org/abs/2408.02666