Výsledky vyhledávání - "Mannelli, Stefano Sarao"

Report

Optimal Protocols for Continual Learning via Statistical Physics and Control Theory

Autor: Mori, Francesco, Mannelli, Stefano Sarao, Mignacco, Francesca

Artificial neural networks often struggle with catastrophic forgetting when learning multiple tasks sequentially, as training on new tasks degrades the performance on previously learned ones. Recent theoretical work has addressed this issue by analys

Externí odkaz: http://arxiv.org/abs/2409.18061

Zobrazit plný text záznamu

Report

Tilting the Odds at the Lottery: the Interplay of Overparameterisation and Curricula in Neural Networks

Autor: Mannelli, Stefano Sarao, Ivashynka, Yaraslau, Saxe, Andrew, Saglietti, Luca

A wide range of empirical and theoretical works have shown that overparameterisation can amplify the performance of neural networks. According to the lottery ticket hypothesis, overparameterised networks have an increased chance of containing a sub-n

Externí odkaz: http://arxiv.org/abs/2406.01589

Zobrazit plný text záznamu

Report

Bias in Motion: Theoretical Insights into the Dynamics of Bias in SGD Training

Autor: Jain, Anchit, Nobahari, Rozhin, Baratin, Aristide, Mannelli, Stefano Sarao

Machine learning systems often acquire biases by leveraging undesired features in the data, impacting accuracy variably across different sub-populations. Current understanding of bias formation mostly focuses on the initial and final stages of learni

Externí odkaz: http://arxiv.org/abs/2405.18296

Zobrazit plný text záznamu

Report

Why Do Animals Need Shaping? A Theory of Task Composition and Curriculum Learning

Autor: Lee, Jin Hwa, Mannelli, Stefano Sarao, Saxe, Andrew

Diverse studies in systems neuroscience begin with extended periods of curriculum training known as `shaping' procedures. These involve progressively studying component parts of more complex tasks, and can make the difference between learning a task

Externí odkaz: http://arxiv.org/abs/2402.18361

Zobrazit plný text záznamu

Report

The RL Perceptron: Generalisation Dynamics of Policy Learning in High Dimensions

Autor: Patel, Nishil, Lee, Sebastian, Mannelli, Stefano Sarao, Goldt, Sebastian, Saxe, Andrew

Reinforcement learning (RL) algorithms have proven transformative in a range of domains. To tackle real-world domains, these systems often use neural networks to learn policies directly from pixels or other high-dimensional sensory input. By contrast

Externí odkaz: http://arxiv.org/abs/2306.10404

Zobrazit plný text záznamu

Report

Optimal transfer protocol by incremental layer defrosting

Autor: Gerace, Federica, Doimo, Diego, Mannelli, Stefano Sarao, Saglietti, Luca, Laio, Alessandro

Transfer learning is a powerful tool enabling model training with limited amounts of data. This technique is particularly useful in real-world problems where data availability is often a serious limitation. The simplest transfer learning protocol is

Externí odkaz: http://arxiv.org/abs/2303.01429

Zobrazit plný text záznamu

Report

Bias-inducing geometries: an exactly solvable data model with fairness implications

Autor: Mannelli, Stefano Sarao, Gerace, Federica, Rostamzadeh, Negar, Saglietti, Luca

Machine learning (ML) may be oblivious to human bias but it is not immune to its perpetuation. Marginalisation and iniquitous group representation are often traceable in the very data used for training, and may be reflected or even enhanced by the le

Externí odkaz: http://arxiv.org/abs/2205.15935

Zobrazit plný text záznamu

Report

Maslow's Hammer for Catastrophic Forgetting: Node Re-Use vs Node Activation

Autor: Lee, Sebastian, Mannelli, Stefano Sarao, Clopath, Claudia, Goldt, Sebastian, Saxe, Andrew

Publikováno v: Proceedings of the 39th International Conference on Machine Learning, PMLR 162:12455-12477 (2022)

Continual learning - learning new tasks in sequence while maintaining performance on old tasks - remains particularly challenging for artificial neural networks. Surprisingly, the amount of forgetting does not increase with the dissimilarity between

Externí odkaz: http://arxiv.org/abs/2205.09029

Zobrazit plný text záznamu

Report

An Analytical Theory of Curriculum Learning in Teacher-Student Networks

Autor: Saglietti, Luca, Mannelli, Stefano Sarao, Saxe, Andrew

In humans and animals, curriculum learning -- presenting data in a curated order - is critical to rapid learning and effective pedagogy. Yet in machine learning, curricula are not widely used and empirically often yield only moderate benefits. This s

Externí odkaz: http://arxiv.org/abs/2106.08068

Zobrazit plný text záznamu

Report

Probing transfer learning with a model of synthetic correlated datasets

Autor: Gerace, Federica, Saglietti, Luca, Mannelli, Stefano Sarao, Saxe, Andrew, Zdeborová, Lenka

Publikováno v: Machine Learning: Science and Technology 3.1 (2022): 015030

Transfer learning can significantly improve the sample efficiency of neural networks, by exploiting the relatedness between a data-scarce target task and a data-abundant source task. Despite years of successful applications, transfer learning practic

Externí odkaz: http://arxiv.org/abs/2106.05418

Zobrazit plný text záznamu

Vyhledávací nástroje:

Upřesnit hledání