Výsledky vyhledávání - "Szlam, Arthur"

Report

Autor: Douillard, Arthur, Feng, Qixuan, Rusu, Andrei A., Kuncoro, Adhiguna, Donchev, Yani, Chhaparia, Rachita, Gog, Ionel, Ranzato, Marc'Aurelio, Shen, Jiajun, Szlam, Arthur

Progress in machine learning (ML) has been fueled by scaling neural network models. This scaling has been enabled by ever more heroic feats of engineering, necessary for accommodating ML approaches that require high bandwidth communication between de

Externí odkaz: http://arxiv.org/abs/2403.10616

Zobrazit plný text záznamu

Report

DiLoCo: Distributed Low-Communication Training of Language Models

Autor: Douillard, Arthur, Feng, Qixuan, Rusu, Andrei A., Chhaparia, Rachita, Donchev, Yani, Kuncoro, Adhiguna, Ranzato, Marc'Aurelio, Szlam, Arthur, Shen, Jiajun

Large language models (LLM) have become a critical component in many applications of machine learning. However, standard approaches to training LLM require a large number of tightly interconnected accelerators, with devices exchanging gradients and o

Externí odkaz: http://arxiv.org/abs/2311.08105

Zobrazit plný text záznamu

Report

A Data Source for Reasoning Embodied Agents

Autor: Lanchantin, Jack, Sukhbaatar, Sainbayar, Synnaeve, Gabriel, Sun, Yuxuan, Srinet, Kavya, Szlam, Arthur

Recent progress in using machine learning models for reasoning tasks has been driven by novel model architectures, large-scale pre-training protocols, and dedicated reasoning datasets for fine-tuning. In this work, to further pursue these advances, w

Externí odkaz: http://arxiv.org/abs/2309.07974

Zobrazit plný text záznamu

Report

Transforming Human-Centered AI Collaboration: Redefining Embodied Agents Capabilities through Interactive Grounded Language Instructions

Autor: Mohanty, Shrestha, Arabzadeh, Negar, Kiseleva, Julia, Zholus, Artem, Teruel, Milagro, Awadallah, Ahmed, Sun, Yuxuan, Srinet, Kavya, Szlam, Arthur

Human intelligence's adaptability is remarkable, allowing us to adjust to new tasks and multi-modal environments swiftly. This skill is evident from a young age as we acquire new abilities and solve problems by imitating others or following natural l

Externí odkaz: http://arxiv.org/abs/2305.10783

Zobrazit plný text záznamu

Report

Learning to Reason and Memorize with Self-Notes

Autor: Lanchantin, Jack, Toshniwal, Shubham, Weston, Jason, Szlam, Arthur, Sukhbaatar, Sainbayar

Large language models have been shown to struggle with multi-step reasoning, and do not retain previous reasoning steps for future use. We propose a simple method for solving both of these problems by allowing the model to take Self-Notes. Unlike rec

Externí odkaz: http://arxiv.org/abs/2305.00833

Zobrazit plný text záznamu

Report

Multi-Party Chat: Conversational Agents in Group Settings with Humans and Models

Autor: Wei, Jimmy, Shuster, Kurt, Szlam, Arthur, Weston, Jason, Urbanek, Jack, Komeili, Mojtaba

Current dialogue research primarily studies pairwise (two-party) conversations, and does not address the everyday setting where more than two speakers converse together. In this work, we both collect and evaluate multi-party conversations to study th

Externí odkaz: http://arxiv.org/abs/2304.13835

Zobrazit plný text záznamu

Report

Infusing Commonsense World Models with Graph Knowledge

Autor: Gurung, Alexander, Komeili, Mojtaba, Szlam, Arthur, Weston, Jason, Urbanek, Jack

While language models have become more capable of producing compelling language, we find there are still gaps in maintaining consistency, especially when describing events in a dynamically changing world. We study the setting of generating narratives

Externí odkaz: http://arxiv.org/abs/2301.05746

Zobrazit plný text záznamu

Report

Collecting Interactive Multi-modal Datasets for Grounded Language Understanding

Autor: Mohanty, Shrestha, Arabzadeh, Negar, Teruel, Milagro, Sun, Yuxuan, Zholus, Artem, Skrynnik, Alexey, Burtsev, Mikhail, Srinet, Kavya, Panov, Aleksandr, Szlam, Arthur, Côté, Marc-Alexandre, Kiseleva, Julia

Publikováno v: Interactive Learning for Natural Language Processing NeurIPS 2022 Workshop

Human intelligence can remarkably adapt quickly to new tasks and environments. Starting from a very young age, humans acquire new skills and learn how to solve new tasks either by imitating the behavior of others or by following provided natural lang

Externí odkaz: http://arxiv.org/abs/2211.06552

Zobrazit plný text záznamu

Report

CLIP-Fields: Weakly Supervised Semantic Fields for Robotic Memory

Autor: Shafiullah, Nur Muhammad Mahi, Paxton, Chris, Pinto, Lerrel, Chintala, Soumith, Szlam, Arthur

We propose CLIP-Fields, an implicit scene model that can be used for a variety of tasks, such as segmentation, instance identification, semantic search over space, and view localization. CLIP-Fields learns a mapping from spatial locations to semantic

Externí odkaz: http://arxiv.org/abs/2210.05663

Zobrazit plný text záznamu

Vyhledávací nástroje:

Upřesnit hledání