Zobrazeno 1 - 10
of 6 824
pro vyhledávání: '"Hall, David A"'
Training language models currently requires pre-determining a fixed compute budget because the typical cosine learning rate schedule depends on the total number of steps. In contrast, the Warmup-Stable-Decay (WSD) schedule uses a constant learning ra
Externí odkaz:
http://arxiv.org/abs/2410.05192
Autor:
Skottfelt, Jesper, Wander, Matt, Cropper, Mark, Dryer, Ben, Hall, David J., Hayes, Richard, Kelman, Bradley, Kitching, Tom, Kohley, Ralf, Lagattuta, David, Lee-Payne, Zoe, Liebing, Patricia, Massey, Richard, McCracken, Henry Joy, Nakajima, Reiko, Nightingale, James
Due to the space radiation environment at L2, ESA's Euclid mission will be subject to a large amount of highly energetic particles over its lifetime. These particles can cause damage to the detectors by creating defects in the silicon lattice. These
Externí odkaz:
http://arxiv.org/abs/2407.01268
Neural fields provide a continuous scene representation of 3D geometry and appearance in a way which has great promise for robotics applications. One functionality that unlocks unique use-cases for neural fields in robotics is object 6-DoF registrati
Externí odkaz:
http://arxiv.org/abs/2404.18381
Autor:
Bolton, Elliot, Venigalla, Abhinav, Yasunaga, Michihiro, Hall, David, Xiong, Betty, Lee, Tony, Daneshjou, Roxana, Frankle, Jonathan, Liang, Percy, Carbin, Michael, Manning, Christopher D.
Models such as GPT-4 and Med-PaLM 2 have demonstrated impressive performance on a wide variety of biomedical NLP tasks. However, these models have hundreds of billions of parameters, are computationally expensive to run, require users to send their i
Externí odkaz:
http://arxiv.org/abs/2403.18421
Neural fields, coordinate-based neural networks, have recently gained popularity for implicitly representing a scene. In contrast to classical methods that are based on explicit representations such as point clouds, neural fields provide a continuous
Externí odkaz:
http://arxiv.org/abs/2402.09722
Publikováno v:
Phys. Rev. Research 6, 013046 (2024)
We systematically and analytically construct a set of spinor wave functions representing defects and textures that continuously penetrate interfaces between coexisting, topologically distinct magnetic phases in a spin-2 Bose-Einstein condensate. Thes
Externí odkaz:
http://arxiv.org/abs/2309.17394
Autor:
Rao, Yuhan, Redmon, Rob, Dale, Kirstine, Haupt, Sue E., Hopkinson, Aaron, Bostrom, Ann, Boukabara, Sid, Geenen, Thomas, Hall, David M., Smith, Benjamin D., Niyogi, Dev, Ramaswamy, V., Kihn, Eric A.
The accelerated change in our planet due to human activities has led to grand societal challenges including health crises, intensified extreme weather events, food security, environmental injustice, etc. Digital twin systems combined with emerging te
Externí odkaz:
http://arxiv.org/abs/2306.11175
We introduce anticipation: a method for constructing a controllable generative model of a temporal point process (the event process) conditioned asynchronously on realizations of a second, correlated process (the control process). We achieve this by
Externí odkaz:
http://arxiv.org/abs/2306.08620
Given the massive cost of language model pre-training, a non-trivial improvement of the optimization algorithm would lead to a material reduction on the time and cost of training. Adam and its variants have been state-of-the-art for years, and more s
Externí odkaz:
http://arxiv.org/abs/2305.14342