Výsledky vyhledávání - "Khona, Mikail"

Report

Uncovering Latent Memories: Assessing Data Leakage and Memorization Patterns in Frontier AI Models

Autor: Duan, Sunny, Khona, Mikail, Iyer, Abhiram, Schaeffer, Rylan, Fiete, Ila R

Frontier AI systems are making transformative impacts across society, but such benefits are not without costs: models trained on web-scale datasets containing personal and private data raise profound concerns about data privacy and security. Language

Externí odkaz: http://arxiv.org/abs/2406.14549

Zobrazit plný text záznamu

Report

In-Context Learning of Energy Functions

Autor: Schaeffer, Rylan, Khona, Mikail, Koyejo, Sanmi

In-context learning is a powerful capability of certain machine learning models that arguably underpins the success of today's frontier AI models. However, in-context learning is critically limited to settings where the in-context distribution of int

Externí odkaz: http://arxiv.org/abs/2406.12785

Zobrazit plný text záznamu

Report

Towards an Improved Understanding and Utilization of Maximum Manifold Capacity Representations

Autor: Schaeffer, Rylan, Lecomte, Victor, Pai, Dhruv Bhandarkar, Carranza, Andres, Isik, Berivan, Unell, Alyssa, Khona, Mikail, Yerxa, Thomas, LeCun, Yann, Chung, SueYeon, Gromov, Andrey, Shwartz-Ziv, Ravid, Koyejo, Sanmi

Maximum Manifold Capacity Representations (MMCR) is a recent multi-view self-supervised learning (MVSSL) method that matches or surpasses other leading MVSSL methods. MMCR is intriguing because it does not fit neatly into any of the commonplace MVSSL

Externí odkaz: http://arxiv.org/abs/2406.09366

Zobrazit plný text záznamu

Report

Large language models surpass human experts in predicting neuroscience results

Scientific discoveries often hinge on synthesizing decades of research, a task that potentially outstrips human information processing capacities. Large language models (LLMs) offer a solution. LLMs trained on the vast scientific literature could pot

Externí odkaz: http://arxiv.org/abs/2403.03230

Zobrazit plný text záznamu

Report

Bridging Associative Memory and Probabilistic Modeling

Autor: Schaeffer, Rylan, Zahedi, Nika, Khona, Mikail, Pai, Dhruv, Truong, Sang, Du, Yilun, Ostrow, Mitchell, Chandra, Sarthak, Carranza, Andres, Fiete, Ila Rani, Gromov, Andrey, Koyejo, Sanmi

Associative memory and probabilistic modeling are two fundamental topics in artificial intelligence. The first studies recurrent neural networks designed to denoise, complete and retrieve data, whereas the second studies learning and sampling from pr

Externí odkaz: http://arxiv.org/abs/2402.10202

Zobrazit plný text záznamu

Report

Towards an Understanding of Stepwise Inference in Transformers: A Synthetic Graph Navigation Model

Autor: Khona, Mikail, Okawa, Maya, Hula, Jan, Ramesh, Rahul, Nishi, Kento, Dick, Robert, Lubana, Ekdeep Singh, Tanaka, Hidenori

Stepwise inference protocols, such as scratchpads and chain-of-thought, help language models solve complex problems by decomposing them into a sequence of simpler subproblems. Despite the significant gain in performance achieved via these protocols,

Externí odkaz: http://arxiv.org/abs/2402.07757

Zobrazit plný text záznamu

Report

Disentangling Fact from Grid Cell Fiction in Trained Deep Path Integrators

Autor: Schaeffer, Rylan, Khona, Mikail, Koyejo, Sanmi, Fiete, Ila Rani

Work on deep learning-based models of grid cells suggests that grid cells generically and robustly arise from optimizing networks to path integrate, i.e., track one's spatial position by integrating self-velocity signals. In previous work, we challen

Externí odkaz: http://arxiv.org/abs/2312.03954

Zobrazit plný text záznamu

Report

Compositional Capabilities of Autoregressive Transformers: A Study on Synthetic, Interpretable Tasks

Autor: Ramesh, Rahul, Lubana, Ekdeep Singh, Khona, Mikail, Dick, Robert P., Tanaka, Hidenori

Transformers trained on huge text corpora exhibit a remarkable set of capabilities, e.g., performing basic arithmetic. Given the inherent compositional nature of language, one can expect the model to learn to compose these capabilities, potentially y

Externí odkaz: http://arxiv.org/abs/2311.12997

Zobrazit plný text záznamu

Report

Self-Supervised Learning of Representations for Space Generates Multi-Modular Grid Cells

Autor: Schaeffer, Rylan, Khona, Mikail, Ma, Tzuhsuan, Eyzaguirre, Cristóbal, Koyejo, Sanmi, Fiete, Ila Rani

To solve the spatial problems of mapping, localization and navigation, the mammalian lineage has developed striking spatial representations. One important spatial representation is the Nobel-prize winning grid cells: neurons that represent self-locat

Externí odkaz: http://arxiv.org/abs/2311.02316

Zobrazit plný text záznamu

Report

Growing Brains: Co-emergence of Anatomical and Functional Modularity in Recurrent Neural Networks

Autor: Liu, Ziming, Khona, Mikail, Fiete, Ila R., Tegmark, Max

Recurrent neural networks (RNNs) trained on compositional tasks can exhibit functional modularity, in which neurons can be clustered by activity similarity and participation in shared computational subtasks. Unlike brains, these RNNs do not exhibit a

Externí odkaz: http://arxiv.org/abs/2310.07711

Zobrazit plný text záznamu

Vyhledávací nástroje:

Upřesnit hledání