Výsledky vyhledávání

Report

Emergent properties with repeated examples

We study the performance of transformers as a function of the number of repetitions of training examples with algorithmically generated datasets. On three problems of mathematics: the greatest common divisor, modular multiplication, and matrix eigenv

Externí odkaz: http://arxiv.org/abs/2410.07041

Zobrazit plný text záznamu

Report

Strong Model Collapse

Autor: Dohmatob, Elvis, Feng, Yunzhen, Subramonian, Arjun, Kempe, Julia

Within the scaling laws paradigm, which underpins the training of large neural networks like ChatGPT and Llama, we consider a supervised regression setting and establish the existance of a strong form of the model collapse phenomenon, a critical perf

Externí odkaz: http://arxiv.org/abs/2410.04840

Zobrazit plný text záznamu

Report

Mission Impossible: A Statistical Perspective on Jailbreaking LLMs

Autor: Su, Jingtong, Kempe, Julia, Ullrich, Karen

Large language models (LLMs) are trained on a deluge of text data with limited quality control. As a result, LLMs can exhibit unintended or even harmful behaviours, such as leaking information, fake news or hate speech. Countermeasures, commonly refe

Externí odkaz: http://arxiv.org/abs/2408.01420

Zobrazit plný text záznamu

Report

Beyond Model Collapse: Scaling Up with Synthesized Data Requires Reinforcement

Autor: Feng, Yunzhen, Dohmatob, Elvis, Yang, Pu, Charton, Francois, Kempe, Julia

Synthesized data from generative models is increasingly considered as an alternative to human-annotated data for fine-tuning Large Language Models. This raises concerns about model collapse: a drop in performance of models fine-tuned on generated dat

Externí odkaz: http://arxiv.org/abs/2406.07515

Zobrazit plný text záznamu

Report

The Price of Implicit Bias in Adversarially Robust Generalization

Autor: Tsilivis, Nikolaos, Frank, Natalie, Srebro, Nathan, Kempe, Julia

We study the implicit bias of optimization in robust empirical risk minimization (robust ERM) and its connection with robust generalization. In classification settings under adversarial perturbations with linear models, we study what type of regulari

Externí odkaz: http://arxiv.org/abs/2406.04981

Zobrazit plný text záznamu

Report

Iteration Head: A Mechanistic Study of Chain-of-Thought

Autor: Cabannes, Vivien, Arnal, Charles, Bouaziz, Wassim, Yang, Alice, Charton, Francois, Kempe, Julia

Chain-of-Thought (CoT) reasoning is known to improve Large Language Models both empirically and in terms of theoretical approximation power. However, our understanding of the inner workings and conditions of apparition of CoT capabilities remains lim

Externí odkaz: http://arxiv.org/abs/2406.02128

Zobrazit plný text záznamu

Report

Attacking Bayes: On the Adversarial Robustness of Bayesian Neural Networks

Autor: Feng, Yunzhen, Rudner, Tim G. J., Tsilivis, Nikolaos, Kempe, Julia

Adversarial examples have been shown to cause neural networks to fail on a wide range of vision and language tasks, but recent work has claimed that Bayesian neural networks (BNNs) are inherently robust to adversarial perturbations. In this work, we

Externí odkaz: http://arxiv.org/abs/2404.19640

Zobrazit plný text záznamu

Report

Robust Data Pruning: Uncovering and Overcoming Implicit Bias

Autor: Vysogorets, Artem, Ahuja, Kartik, Kempe, Julia

In the era of exceptionally data-hungry models, careful selection of the training data is essential to mitigate the extensive costs of deep learning. Data pruning offers a solution by removing redundant or uninformative samples from the dataset, whic

Externí odkaz: http://arxiv.org/abs/2404.05579

Zobrazit plný text záznamu

Report

Mind the GAP: Improving Robustness to Subpopulation Shifts with Group-Aware Priors

Autor: Rudner, Tim G. J., Zhang, Ya Shi, Wilson, Andrew Gordon, Kempe, Julia

Machine learning models often perform poorly under subpopulation shifts in the data distribution. Developing methods that allow machine learning models to better generalize to such shifts is crucial for safe deployment in real-world settings. In this

Externí odkaz: http://arxiv.org/abs/2403.09869

Zobrazit plný text záznamu

Report

Stability and Multigroup Fairness in Ranking with Uncertain Predictions

Autor: Devic, Siddartha, Korolova, Aleksandra, Kempe, David, Sharan, Vatsal

Rankings are ubiquitous across many applications, from search engines to hiring committees. In practice, many rankings are derived from the output of predictors. However, when predictors trained for classification tasks have intrinsic uncertainty, it

Externí odkaz: http://arxiv.org/abs/2402.09326

Zobrazit plný text záznamu

Vyhledávací nástroje:

Upřesnit hledání