Výsledky vyhledávání - "Hardt, Moritz"

Report

Lawma: The Power of Specialization for Legal Tasks

Autor: Dominguez-Olmedo, Ricardo, Nanda, Vedant, Abebe, Rediet, Bechtold, Stefan, Engel, Christoph, Frankenreiter, Jens, Gummadi, Krishna, Hardt, Moritz, Livermore, Michael

Annotation and classification of legal text are central components of empirical legal research. Traditionally, these tasks are often delegated to trained research assistants. Motivated by the advances in language modeling, empirical legal scholars ar

Externí odkaz: http://arxiv.org/abs/2407.16615

Zobrazit plný text záznamu

Report

Evaluating language models as risk scores

Autor: Cruz, André F., Hardt, Moritz, Mendler-Dünner, Celestine

Current question-answering benchmarks predominantly focus on accuracy in realizable prediction tasks. Conditioned on a question and answer-key, does the most likely token match the ground truth? Such benchmarks necessarily fail to evaluate language m

Externí odkaz: http://arxiv.org/abs/2407.14614

Zobrazit plný text záznamu

Report

Training on the Test Task Confounds Evaluation and Emergence

Autor: Dominguez-Olmedo, Ricardo, Dorner, Florian E., Hardt, Moritz

We study a fundamental problem in the evaluation of large language models that we call training on the test task. Unlike wrongful practices like training on the test data, leakage, or data contamination, training on the test task is not a malpractice

Externí odkaz: http://arxiv.org/abs/2407.07890

Zobrazit plný text záznamu

Report

Limits to Predicting Online Speech Using Large Language Models

Autor: Remeli, Mina, Hardt, Moritz, Williamson, Robert C.

We study the predictability of online speech on social media, and whether predictability improves with information outside a user's own posts. Recent work suggests that the predictive information contained in posts written by a user's peers can surpa

Externí odkaz: http://arxiv.org/abs/2407.12850

Zobrazit plný text záznamu

Report

Allocation Requires Prediction Only if Inequality Is Low

Autor: Shirali, Ali, Abebe, Rediet, Hardt, Moritz

Algorithmic predictions are emerging as a promising solution concept for efficiently allocating societal resources. Fueling their use is an underlying assumption that such systems are necessary to identify individuals for interventions. We propose a

Externí odkaz: http://arxiv.org/abs/2406.13882

Zobrazit plný text záznamu

Report

Causal Inference from Competing Treatments

Autor: Stoica, Ana-Andreea, Nastl, Vivian Y., Hardt, Moritz

Many applications of RCTs involve the presence of multiple treatment administrators -- from field experiments to online advertising -- that compete for the subjects' attention. In the face of competition, estimating a causal effect becomes difficult,

Externí odkaz: http://arxiv.org/abs/2406.03422

Zobrazit plný text záznamu

Report

An engine not a camera: Measuring performative power of online search

Autor: Mendler-Dünner, Celestine, Carovano, Gabriele, Hardt, Moritz

The power of digital platforms is at the center of major ongoing policy and regulatory efforts. To advance existing debates, we designed and executed an experiment to measure the power of online search providers, building on the recent definition of

Externí odkaz: http://arxiv.org/abs/2405.19073

Zobrazit plný text záznamu

Report

Inherent Trade-Offs between Diversity and Stability in Multi-Task Benchmarks

Autor: Zhang, Guanhua, Hardt, Moritz

We examine multi-task benchmarks in machine learning through the lens of social choice theory. We draw an analogy between benchmarks and electoral systems, where models are candidates and tasks are voters. This suggests a distinction between cardinal

Externí odkaz: http://arxiv.org/abs/2405.01719

Zobrazit plný text záznamu

Report

ImageNot: A contrast with ImageNet preserves model rankings

Autor: Salaudeen, Olawale, Hardt, Moritz

We introduce ImageNot, a dataset designed to match the scale of ImageNet while differing drastically in other aspects. We show that key model architectures developed for ImageNet over the years rank identically when trained and evaluated on ImageNot

Externí odkaz: http://arxiv.org/abs/2404.02112

Zobrazit plný text záznamu

Report

Predictors from causal features do not generalize better to new domains

Autor: Nastl, Vivian Y., Hardt, Moritz

We study how well machine learning models trained on causal features generalize across domains. We consider 16 prediction tasks on tabular datasets covering applications in health, employment, education, social benefits, and politics. Each dataset co

Externí odkaz: http://arxiv.org/abs/2402.09891

Zobrazit plný text záznamu

Vyhledávací nástroje:

Upřesnit hledání