Zobrazeno 1 - 10
of 17 525
pro vyhledávání: '"P, Weston"'
Autor:
Weston, Robert
We consider the cyclic representations $\Omega_{rs}$ of $ U_q(\widehat{\mathfrak{sl}}_2)$ at $q^N=1$ that depend upon two points $r,s$ in the chiral Potts algebraic curve. We show how $\Omega_{rs}$ is related to the tensor product $\rho_r\otimes \bar
Externí odkaz:
http://arxiv.org/abs/2412.14811
Autor:
Pagnoni, Artidoro, Pasunuru, Ram, Rodriguez, Pedro, Nguyen, John, Muller, Benjamin, Li, Margaret, Zhou, Chunting, Yu, Lili, Weston, Jason, Zettlemoyer, Luke, Ghosh, Gargi, Lewis, Mike, Holtzman, Ari, Iyer, Srinivasan
We introduce the Byte Latent Transformer (BLT), a new byte-level LLM architecture that, for the first time, matches tokenization-based LLM performance at scale with significant improvements in inference efficiency and robustness. BLT encodes bytes in
Externí odkaz:
http://arxiv.org/abs/2412.09871
Autor:
Hao, Shibo, Sukhbaatar, Sainbayar, Su, DiJia, Li, Xian, Hu, Zhiting, Weston, Jason, Tian, Yuandong
Large language models (LLMs) are restricted to reason in the "language space", where they typically express the reasoning process with a chain-of-thought (CoT) to solve a complex reasoning problem. However, we argue that language space may not always
Externí odkaz:
http://arxiv.org/abs/2412.06769
Autor:
Yasunaga, Michihiro, Shamis, Leonid, Zhou, Chunting, Cohen, Andrew, Weston, Jason, Zettlemoyer, Luke, Ghazvininejad, Marjan
Recent approaches to large language model (LLM) alignment typically require millions of human annotations or rely on external aligned models for synthetic data generation. This paper introduces ALMA: Alignment with Minimal Annotation, demonstrating t
Externí odkaz:
http://arxiv.org/abs/2412.04305
Autor:
Tillotson, Evan, McHugh, James, Howarth, James, Hashimoto, Teruo, Clark, Nick, Weston, Astrid, Enaldiev, Vladimir, Sullivan-Allsop, Samuel, Thornley, William, Wang, Wendong, Lindley, Matthew, Pollard, Andrew, Falko, Vladimir, Gorbachev, Roman, Haigh, Sarah J.
Twisted 2D material heterostructures provide an exciting platform for investigating new fundamental physical phenomena. Many of the most interesting behaviours emerge at small twist angles, where the materials reconstruct to form areas of perfectly s
Externí odkaz:
http://arxiv.org/abs/2411.16248
Autor:
Dhuliawala, Shehzaad, Kulikov, Ilia, Yu, Ping, Celikyilmaz, Asli, Weston, Jason, Sukhbaatar, Sainbayar, Lanchantin, Jack
During language model decoding, it is known that using higher temperature sampling gives more creative responses, while lower temperatures are more factually accurate. However, such models are commonly applied to general instruction following, which
Externí odkaz:
http://arxiv.org/abs/2411.09661
Autor:
Prasad, Archiki, Yuan, Weizhe, Pang, Richard Yuanzhe, Xu, Jing, Fazel-Zarandi, Maryam, Bansal, Mohit, Sukhbaatar, Sainbayar, Weston, Jason, Yu, Jane
Self-alignment, whereby models learn to improve themselves without human annotation, is a rapidly growing research area. However, existing techniques often fail to improve complex reasoning tasks due to the difficulty of assigning correct rewards. An
Externí odkaz:
http://arxiv.org/abs/2411.04109
LLMs are typically trained to answer user questions or follow instructions similarly to how human experts respond. However, in the standard alignment framework they lack the basic ability of explicit thinking before answering. Thinking is important f
Externí odkaz:
http://arxiv.org/abs/2410.10630
Autor:
Molina, Isabella, Chomiuk, Laura, Linford, Justin D., Aydi, Elias, Mioduszewski, Amy J., Mukai, Koji, Sokolovsky, Kirill V., Strader, Jay, Craig, Peter, Dong, Dillon, Harris, Chelsea E., Nyamai, Miriam M., Rupen, Michael P., Sokoloski, Jennifer L., Walter, Frederick M., Weston, Jennifer H. S., Williams, Montana N.
V745 Sco is a Galactic symbiotic recurrent nova with nova eruptions in 1937, 1989 and 2014. We study the behavior of V745 Sco at radio wavelengths (0.6-37,GHz), covering both its 1989 and 2014 eruptions and informed by optical, X-ray, and $\gamma$-ra
Externí odkaz:
http://arxiv.org/abs/2410.01125
Autor:
Chappell, Digby, Yang, Zeyu, Clark, Angus B., Berkovic, Alexandre, Laganier, Colin, Baxter, Weston, Bello, Fernando, Kormushev, Petar, Rojas, Nicolas
Myoelectric prosthetic hands are typically controlled to move between discrete positions and do not provide sensory feedback to the user. In this work, we present and evaluate a closed-loop, continuous myoelectric prosthetic hand controller, that can
Externí odkaz:
http://arxiv.org/abs/2409.15578