Zobrazeno 1 - 10
of 3 932
pro vyhledávání: '"Hardt P"'
The breakup dynamics of viscous liquid bridges on solid surfaces is studied experimentally. It is found that the dynamics bears similarities to the breakup of free liquid bridges in the viscous regime. Nevertheless, the dynamics is significantly infl
Externí odkaz:
http://arxiv.org/abs/2408.01790
Autor:
Dominguez-Olmedo, Ricardo, Nanda, Vedant, Abebe, Rediet, Bechtold, Stefan, Engel, Christoph, Frankenreiter, Jens, Gummadi, Krishna, Hardt, Moritz, Livermore, Michael
Annotation and classification of legal text are central components of empirical legal research. Traditionally, these tasks are often delegated to trained research assistants. Motivated by the advances in language modeling, empirical legal scholars ar
Externí odkaz:
http://arxiv.org/abs/2407.16615
Current question-answering benchmarks predominantly focus on accuracy in realizable prediction tasks. Conditioned on a question and answer-key, does the most likely token match the ground truth? Such benchmarks necessarily fail to evaluate LLMs' abil
Externí odkaz:
http://arxiv.org/abs/2407.14614
We study a fundamental problem in the evaluation of large language models that we call training on the test task. Unlike wrongful practices like training on the test data, leakage, or data contamination, training on the test task is not a malpractice
Externí odkaz:
http://arxiv.org/abs/2407.07890
We study the predictability of online speech on social media, and whether predictability improves with information outside a user's own posts. Recent work suggests that the predictive information contained in posts written by a user's peers can surpa
Externí odkaz:
http://arxiv.org/abs/2407.12850
Algorithmic predictions are emerging as a promising solution concept for efficiently allocating societal resources. Fueling their use is an underlying assumption that such systems are necessary to identify individuals for interventions. We propose a
Externí odkaz:
http://arxiv.org/abs/2406.13882
Many applications of RCTs involve the presence of multiple treatment administrators -- from field experiments to online advertising -- that compete for the subjects' attention. In the face of competition, estimating a causal effect becomes difficult,
Externí odkaz:
http://arxiv.org/abs/2406.03422
The power of digital platforms is at the center of major ongoing policy and regulatory efforts. To advance existing debates, we designed and executed an experiment to measure the power of online search providers, building on the recent definition of
Externí odkaz:
http://arxiv.org/abs/2405.19073
Active matter encompasses many-particle systems with self-propelling units, such as flocks of birds or schools of fish. Here, we show how self-propelling domain walls can be realised in a solid-state system when a ferrimagnet is weakly driven out of
Externí odkaz:
http://arxiv.org/abs/2405.14320
Autor:
Zhang, Guanhua, Hardt, Moritz
We examine multi-task benchmarks in machine learning through the lens of social choice theory. We draw an analogy between benchmarks and electoral systems, where models are candidates and tasks are voters. This suggests a distinction between cardinal
Externí odkaz:
http://arxiv.org/abs/2405.01719