Výsledky vyhledávání - "Velarde, Pedro"

Report

Large Language Models can Achieve Social Balance

Autor: Cisneros-Velarde, Pedro

Social balance is a concept in sociology which states that if every three individuals in a population achieve certain structures of positive or negative interactions, then the whole population ends up in one faction of positive interactions or divide

Externí odkaz: http://arxiv.org/abs/2410.04054

Zobrazit plný text záznamu

Report

Optimization and Generalization Guarantees for Weight Normalization

Autor: Cisneros-Velarde, Pedro, Chen, Zhijie, Koyejo, Sanmi, Banerjee, Arindam

Weight normalization (WeightNorm) is widely used in practice for the training of deep neural networks and modern deep learning libraries have built-in implementations of it. In this paper, we provide the first theoretical characterizations of both op

Externí odkaz: http://arxiv.org/abs/2409.08935

Zobrazit plný text záznamu

Report

On the Principles behind Opinion Dynamics in Multi-Agent Systems of Large Language Models

Autor: Cisneros-Velarde, Pedro

We study the evolution of opinions inside a population of interacting large language models (LLMs). Every LLM needs to decide how much funding to allocate to an item with three initial possibilities: full, partial, or no funding. We identify biases t

Externí odkaz: http://arxiv.org/abs/2406.15492

Zobrazit plný text záznamu

Report

Finite-sample Guarantees for Nash Q-learning with Linear Function Approximation

Autor: Cisneros-Velarde, Pedro, Koyejo, Sanmi

Nash Q-learning may be considered one of the first and most known algorithms in multi-agent reinforcement learning (MARL) for learning policies that constitute a Nash equilibrium of an underlying general-sum Markov game. Its original proof provided a

Externí odkaz: http://arxiv.org/abs/2303.00177

Zobrazit plný text záznamu

Report

Restricted Strong Convexity of Deep Learning Models with Smooth Activations

Autor: Banerjee, Arindam, Cisneros-Velarde, Pedro, Zhu, Libin, Belkin, Mikhail

We consider the problem of optimization of deep learning models with smooth activation functions. While there exist influential results on the problem from the ``near initialization'' perspective, we shed considerable new light on the problem. In par

Externí odkaz: http://arxiv.org/abs/2209.15106

Zobrazit plný text záznamu

Report

Non-thermal evolution of dense plasmas driven by intense x-ray fields

Autor: Ren, Shenyuan, Shi, Yuanfeng, Berg, Quincy Y. van den, Firmansyah, Muhammad, Chung, Hyun-Kyung, Fernandez-Tello, Elisa V., Velarde, Pedro, Wark, Justin S., Vinko, Sam M.

The advent of x-ray free-electron lasers (XFELs) has enabled a range of new experimental investigations into the properties of matter driven to extreme conditions via intense x-ray-matter interactions. The femtosecond timescales of these interactions

Externí odkaz: http://arxiv.org/abs/2208.00573

Zobrazit plný text záznamu

Report

Discrete State-Action Abstraction via the Successor Representation

Autor: Attali, Amnon, Cisneros-Velarde, Pedro, Morales, Marco, Amato, Nancy M.

While the difficulty of reinforcement learning problems is typically related to the complexity of their state spaces, Abstraction proposes that solutions often lie in simpler underlying latent spaces. Prior works have focused on learning either a con

Externí odkaz: http://arxiv.org/abs/2206.03467

Zobrazit plný text záznamu

Report

One Policy is Enough: Parallel Exploration with a Single Policy is Near-Optimal for Reward-Free Reinforcement Learning

Autor: Cisneros-Velarde, Pedro, Lyu, Boxiang, Koyejo, Sanmi, Kolar, Mladen

Although parallelism has been extensively used in reinforcement learning (RL), the quantitative effects of parallel exploration are not well understood theoretically. We study the benefits of simple parallel exploration for reward-free RL in linear M

Externí odkaz: http://arxiv.org/abs/2205.15891

Zobrazit plný text záznamu

Report

Family-wise error rate control in Gaussian graphical model selection via Distributionally Robust Optimization

Autor: Tran, Chau, Cisneros-Velarde, Pedro, Oh, Sang-Yun, Petersen, Alexander

Recently, a special case of precision matrix estimation based on a distributionally robust optimization (DRO) framework has been shown to be equivalent to the graphical lasso. From this formulation, a method for choosing the regularization term, i.e.

Externí odkaz: http://arxiv.org/abs/2201.12441

Zobrazit plný text záznamu

Report

From Contraction Theory to Fixed Point Algorithms on Riemannian and Non-Euclidean Spaces

Autor: Bullo, Francesco, Cisneros-Velarde, Pedro, Davydov, Alexander, Jafarpour, Saber

The design of fixed point algorithms is at the heart of monotone operator theory, convex analysis, and of many modern optimization problems arising in machine learning and control. This tutorial reviews recent advances in understanding the relationsh

Externí odkaz: http://arxiv.org/abs/2110.03623

Zobrazit plný text záznamu

Vyhledávací nástroje:

Upřesnit hledání