Výsledky vyhledávání - "Rabbat, Michael"

Report

Embracing Diversity: Interpretable Zero-shot classification beyond one vector per class

Autor: Moayeri, Mazda, Rabbat, Michael, Ibrahim, Mark, Bouchacourt, Diane

Vision-language models enable open-world classification of objects without the need for any retraining. While this zero-shot paradigm marks a significant advance, even today's best models exhibit skewed performance when objects are dissimilar from th

Externí odkaz: http://arxiv.org/abs/2404.16717

Zobrazit plný text záznamu

Report

Beyond A*: Better Planning with Transformers via Search Dynamics Bootstrapping

Autor: Lehnert, Lucas, Sukhbaatar, Sainbayar, Su, DiJia, Zheng, Qinqing, Mcvay, Paul, Rabbat, Michael, Tian, Yuandong

While Transformers have enabled tremendous progress in various application settings, such architectures still trail behind traditional symbolic planners for solving complex decision making tasks. In this work, we demonstrate how to train Transformers

Externí odkaz: http://arxiv.org/abs/2402.14083

Zobrazit plný text záznamu

Report

Revisiting Feature Prediction for Learning Visual Representations from Video

Autor: Bardes, Adrien, Garrido, Quentin, Ponce, Jean, Chen, Xinlei, Rabbat, Michael, LeCun, Yann, Assran, Mahmoud, Ballas, Nicolas

This paper explores feature prediction as a stand-alone objective for unsupervised learning from video and introduces V-JEPA, a collection of vision models trained solely using a feature prediction objective, without the use of pretrained image encod

Externí odkaz: http://arxiv.org/abs/2404.08471

Zobrazit plný text záznamu

Report

A Distributed Data-Parallel PyTorch Implementation of the Distributed Shampoo Optimizer for Training Neural Networks At-Scale

Autor: Shi, Hao-Jun Michael, Lee, Tsung-Hsien, Iwasaki, Shintaro, Gallego-Posada, Jose, Li, Zhijing, Rangadurai, Kaushik, Mudigere, Dheevatsa, Rabbat, Michael

Shampoo is an online and stochastic optimization algorithm belonging to the AdaGrad family of methods for training neural networks. It constructs a block-diagonal preconditioner where each block consists of a coarse Kronecker product approximation to

Externí odkaz: http://arxiv.org/abs/2309.06497

Zobrazit plný text záznamu

Report

DINOv2: Learning Robust Visual Features without Supervision

The recent breakthroughs in natural language processing for model pretraining on large quantities of data have opened the way for similar foundation models in computer vision. These models could greatly simplify the use of images in any system by pro

Externí odkaz: http://arxiv.org/abs/2304.07193

Zobrazit plný text záznamu

Report

Green Federated Learning

Autor: Yousefpour, Ashkan, Guo, Shen, Shenoy, Ashish, Ghosh, Sayan, Stock, Pierre, Maeng, Kiwan, Krüger, Schalk-Willem, Rabbat, Michael, Wu, Carole-Jean, Mironov, Ilya

The rapid progress of AI is fueled by increasingly large and computationally intensive machine learning models and datasets. As a consequence, the amount of compute used in training state-of-the-art models is exponentially increasing (doubling every

Externí odkaz: http://arxiv.org/abs/2303.14604

Zobrazit plný text záznamu

Report

Self-Supervised Learning from Images with a Joint-Embedding Predictive Architecture

Autor: Assran, Mahmoud, Duval, Quentin, Misra, Ishan, Bojanowski, Piotr, Vincent, Pascal, Rabbat, Michael, LeCun, Yann, Ballas, Nicolas

This paper demonstrates an approach for learning highly semantic image representations without relying on hand-crafted data-augmentations. We introduce the Image-based Joint-Embedding Predictive Architecture (I-JEPA), a non-generative approach for se

Externí odkaz: http://arxiv.org/abs/2301.08243

Zobrazit plný text záznamu

Report

lo-fi: distributed fine-tuning without communication

Autor: Wortsman, Mitchell, Gururangan, Suchin, Li, Shen, Farhadi, Ali, Schmidt, Ludwig, Rabbat, Michael, Morcos, Ari S.

When fine-tuning large neural networks, it is common to use multiple nodes and to communicate gradients at each optimization step. By contrast, we investigate completely local fine-tuning, which we refer to as lo-fi. During lo-fi, each node is fine-t

Externí odkaz: http://arxiv.org/abs/2210.11948

Zobrazit plný text záznamu

Report

Where to Begin? On the Impact of Pre-Training and Initialization in Federated Learning

Autor: Nguyen, John, Wang, Jianyu, Malik, Kshitiz, Sanjabi, Maziar, Rabbat, Michael

An oft-cited challenge of federated learning is the presence of heterogeneity. \emph{Data heterogeneity} refers to the fact that data from different clients may follow very different distributions. \emph{System heterogeneity} refers to the fact that

Externí odkaz: http://arxiv.org/abs/2210.08090

Zobrazit plný text záznamu

Report

The Hidden Uniform Cluster Prior in Self-Supervised Learning

Autor: Assran, Mahmoud, Balestriero, Randall, Duval, Quentin, Bordes, Florian, Misra, Ishan, Bojanowski, Piotr, Vincent, Pascal, Rabbat, Michael, Ballas, Nicolas

A successful paradigm in representation learning is to perform self-supervised pretraining using tasks based on mini-batch statistics (e.g., SimCLR, VICReg, SwAV, MSN). We show that in the formulation of all these methods is an overlooked prior to le

Externí odkaz: http://arxiv.org/abs/2210.07277

Zobrazit plný text záznamu

Vyhledávací nástroje:

Upřesnit hledání