Výsledky vyhledávání - "Brauner, Jan"

Report

Thousands of AI Authors on the Future of AI

Autor: Grace, Katja, Stewart, Harlan, Sandkühler, Julia Fabienne, Thomas, Stephen, Weinstein-Raun, Ben, Brauner, Jan

In the largest survey of its kind, 2,778 researchers who had published in top-tier artificial intelligence (AI) venues gave predictions on the pace of AI progress and the nature and impacts of advanced AI systems The aggregate forecasts give at least

Externí odkaz: http://arxiv.org/abs/2401.02843

Zobrazit plný text záznamu

Report

Managing extreme AI risks amid rapid progress

Artificial Intelligence (AI) is progressing rapidly, and companies are shifting their focus to developing generalist AI systems that can autonomously act and pursue goals. Increases in capabilities and autonomy may soon massively amplify AI's impact,

Externí odkaz: http://arxiv.org/abs/2310.17688

Zobrazit plný text záznamu

Plný text ve formátu HTML

Report

How to Catch an AI Liar: Lie Detection in Black-Box LLMs by Asking Unrelated Questions

Autor: Pacchiardi, Lorenzo, Chan, Alex J., Mindermann, Sören, Moscovitz, Ilan, Pan, Alexa Y., Gal, Yarin, Evans, Owain, Brauner, Jan

Large language models (LLMs) can "lie", which we define as outputting false statements despite "knowing" the truth in a demonstrable sense. LLMs might "lie", for example, when instructed to output misinformation. Here, we develop a simple lie detecto

Externí odkaz: http://arxiv.org/abs/2309.15840

Zobrazit plný text záznamu

Report

Question Decomposition Improves the Faithfulness of Model-Generated Reasoning

As large language models (LLMs) perform more difficult tasks, it becomes harder to verify the correctness and safety of their behavior. One approach to help with this issue is to prompt LLMs to externalize their reasoning, e.g., by having them genera

Externí odkaz: http://arxiv.org/abs/2307.11768

Zobrazit plný text záznamu

Report

Measuring Faithfulness in Chain-of-Thought Reasoning

Large language models (LLMs) perform better when they produce step-by-step, "Chain-of-Thought" (CoT) reasoning before answering a question, but it is unclear if the stated reasoning is a faithful explanation of the model's actual reasoning (i.e., its

Externí odkaz: http://arxiv.org/abs/2307.13702

Zobrazit plný text záznamu

Report

Prioritized Training on Points that are Learnable, Worth Learning, and Not Yet Learnt

Autor: Mindermann, Sören, Brauner, Jan, Razzak, Muhammed, Sharma, Mrinank, Kirsch, Andreas, Xu, Winnie, Höltgen, Benedikt, Gomez, Aidan N., Morisot, Adrien, Farquhar, Sebastian, Gal, Yarin

Training on web-scale data can take months. But most computation and time is wasted on redundant and noisy points that are already learnt or not learnable. To accelerate training, we introduce Reducible Holdout Loss Selection (RHO-LOSS), a simple but

Externí odkaz: http://arxiv.org/abs/2206.07137

Zobrazit plný text záznamu

Report

Mapping global dynamics of benchmark creation and saturation in artificial intelligence

Autor: Ott, Simon, Barbosa-Silva, Adriano, Blagec, Kathrin, Brauner, Jan, Samwald, Matthias

Publikováno v: Nature Communications volume 13, Article number: 6793 (2022)

Benchmarks are crucial to measuring and steering progress in artificial intelligence (AI). However, recent studies raised concerns over the state of AI benchmarking, reporting issues such as benchmark overfitting, benchmark saturation and increasing

Externí odkaz: http://arxiv.org/abs/2203.04592

Zobrazit plný text záznamu

Report

DeDUCE: Generating Counterfactual Explanations Efficiently

Autor: Höltgen, Benedikt, Schut, Lisa, Brauner, Jan M., Gal, Yarin

When an image classifier outputs a wrong class label, it can be helpful to see what changes in the image would lead to a correct classification. This is the aim of algorithms generating counterfactual explanations. However, there is no easily scalabl

Externí odkaz: http://arxiv.org/abs/2111.15639

Zobrazit plný text záznamu

Report

Prioritized training on points that are learnable, worth learning, and not yet learned (workshop version)

Autor: Mindermann, Sören, Razzak, Muhammed, Xu, Winnie, Kirsch, Andreas, Sharma, Mrinank, Morisot, Adrien, Gomez, Aidan N., Farquhar, Sebastian, Brauner, Jan, Gal, Yarin

Publikováno v: ICML 2021 Workshop on Subset Selection in Machine Learning

We introduce Goldilocks Selection, a technique for faster model training which selects a sequence of training points that are "just right". We propose an information-theoretic acquisition function -- the reducible validation loss -- and compute it wi

Externí odkaz: http://arxiv.org/abs/2107.02565

Zobrazit plný text záznamu

Vyhledávací nástroje:

Upřesnit hledání