Zobrazeno 1 - 10
of 13 428
pro vyhledávání: '"AS Messmer"'
Learning Rate Warmup is a popular heuristic for training neural networks, especially at larger batch sizes, despite limited understanding of its benefits. Warmup decreases the update size $\Delta \mathbf{w}_t = \eta_t \mathbf{u}_t$ early in training
Externí odkaz:
http://arxiv.org/abs/2410.23922
Autor:
Murphy, Robert B., Messmer, Richard P.
A model for the bonding of hypervalent molecules containing sulfur is presented using generalized valence bond wavefunctions. To delineate the model from more commonly used DFT and delocalized wavefunctions, we present detailed comparisons to these o
Externí odkaz:
http://arxiv.org/abs/2410.23593
On-device LLMs have gained increasing attention for their ability to enhance privacy and provide a personalized user experience. To facilitate learning with private and scarce local data, federated learning has become a standard approach, though it i
Externí odkaz:
http://arxiv.org/abs/2409.13931
Autor:
Tang, William, Feibush, Eliot, Dong, Ge, Borthwick, Noah, Lee, Apollo, Gomez, Juan-Felipe, Gibbs, Tom, Stone, John, Messmer, Peter, Wells, Jack, Wei, Xishuo, Lin, Zhihong
In addressing the Department of Energy's April, 2022 announcement of a Bold Decadal Vision for delivering a Fusion Pilot Plant by 2035, associated software tools need to be developed for the integration of real world engineering and supply chain data
Externí odkaz:
http://arxiv.org/abs/2409.03112
In this paper, we explore the application of Unmanned Aerial Vehicles (UAVs) in maritime search and rescue (mSAR) missions, focusing on medium-sized fixed-wing drones and quadcopters. We address the challenges and limitations inherent in operating so
Externí odkaz:
http://arxiv.org/abs/2403.14281
In this study, we systematically evaluate the impact of common design choices in Mixture of Experts (MoEs) on validation performance, uncovering distinct influences at token and sequence levels. We also present empirical evidence showing comparable p
Externí odkaz:
http://arxiv.org/abs/2402.13089
Autor:
Messmer, Martin, Zell, Andreas
Unmanned Aerial Vehicles (UAVs) are emerging as very important tools in search and rescue (SAR) missions at sea, enabling swift and efficient deployment for locating individuals or vessels in distress. The successful execution of these critical missi
Externí odkaz:
http://arxiv.org/abs/2402.01494
Autor:
Martina Messmer, Sandra Eckert, Amor Torre-Marin Rando, Mark Snethlage, Santos J. González-Rojí, Kaspar Hurni, Urs Beyerle, Andreas Hemp, Staline Kibet, Thomas F. Stocker
Publikováno v:
Communications Earth & Environment, Vol 5, Iss 1, Pp 1-12 (2024)
Abstract Grassland landscapes are important ecosystems in East Africa, providing habitat and grazing grounds for wildlife and livestock and supporting pastoralism, an essential part of the agricultural sector. Since future grassland availability dire
Externí odkaz:
https://doaj.org/article/0b5dddab419f471280f6b975b86439a4
Autor:
Carmen A. Pfortmueller, Isabelle Ott, Martin Müller, Darius Wilson, Joerg C. Schefold, Anna S. Messmer
Publikováno v:
Scientific Reports, Vol 14, Iss 1, Pp 1-8 (2024)
Abstract Postoperative fluid overload (FO) after cardiac surgery is common and affects recovery. Predicting FO could help optimize fluid management. This post-hoc analysis of the HERACLES randomized controlled trial evaluated the predictive value of
Externí odkaz:
https://doaj.org/article/66ed269cf46549bd86a55960976a0d98
This study investigates how weight decay affects the update behavior of individual neurons in deep neural networks through a combination of applied analysis and experimentation. Weight decay can cause the expected magnitude and angular updates of a n
Externí odkaz:
http://arxiv.org/abs/2305.17212