Výsledky vyhledávání - "Woodworth, Blake"

Report

Two Losses Are Better Than One: Faster Optimization Using a Cheaper Proxy

Autor: Woodworth, Blake, Mishchenko, Konstantin, Bach, Francis

We present an algorithm for minimizing an objective with hard-to-compute gradients by using a related, easier-to-access function as a proxy. Our algorithm is based on approximate proximal point iterations on the proxy combined with relatively few sto

Externí odkaz: http://arxiv.org/abs/2302.03542

Zobrazit plný text záznamu

Report

Asynchronous SGD Beats Minibatch SGD Under Arbitrary Delays

Autor: Mishchenko, Konstantin, Bach, Francis, Even, Mathieu, Woodworth, Blake

The existing analysis of asynchronous stochastic gradient descent (SGD) degrades dramatically when any delay is large, giving the impression that performance depends primarily on the delay. On the contrary, we prove much better guarantees for the sam

Externí odkaz: http://arxiv.org/abs/2206.07638

Zobrazit plný text záznamu

Report

Non-Convex Optimization with Certificates and Fast Rates Through Kernel Sums of Squares

Autor: Woodworth, Blake, Bach, Francis, Rudi, Alessandro

We consider potentially non-convex optimization problems, for which optimal rates of approximation depend on the dimension of the parameter space and the smoothness of the function to be optimized. In this paper, we propose an algorithm that achieves

Externí odkaz: http://arxiv.org/abs/2204.04970

Zobrazit plný text záznamu

Report

A Stochastic Newton Algorithm for Distributed Convex Optimization

Autor: Bullins, Brian, Patel, Kumar Kshitij, Shamir, Ohad, Srebro, Nathan, Woodworth, Blake

We propose and analyze a stochastic Newton algorithm for homogeneous distributed stochastic convex optimization, where each machine can calculate stochastic gradients of the same population objective, as well as stochastic Hessian-vector products (pr

Externí odkaz: http://arxiv.org/abs/2110.02954

Zobrazit plný text záznamu

Report

The Minimax Complexity of Distributed Optimization

Autor: Woodworth, Blake

In this thesis, I study the minimax oracle complexity of distributed stochastic optimization. First, I present the "graph oracle model", an extension of the classic oracle complexity framework that can be applied to study distributed optimization alg

Externí odkaz: http://arxiv.org/abs/2109.00534

Zobrazit plný text záznamu

Report

A Field Guide to Federated Optimization

Autor: Wang, Jianyu, Charles, Zachary, Xu, Zheng, Joshi, Gauri, McMahan, H. Brendan, Arcas, Blaise Aguera y, Al-Shedivat, Maruan, Andrew, Galen, Avestimehr, Salman, Daly, Katharine, Data, Deepesh, Diggavi, Suhas, Eichner, Hubert, Gadhikar, Advait, Garrett, Zachary, Girgis, Antonious M., Hanzely, Filip, Hard, Andrew, He, Chaoyang, Horvath, Samuel, Huo, Zhouyuan, Ingerman, Alex, Jaggi, Martin, Javidi, Tara, Kairouz, Peter, Kale, Satyen, Karimireddy, Sai Praneeth, Konecny, Jakub, Koyejo, Sanmi, Li, Tian, Liu, Luyang, Mohri, Mehryar, Qi, Hang, Reddi, Sashank J., Richtarik, Peter, Singhal, Karan, Smith, Virginia, Soltanolkotabi, Mahdi, Song, Weikang, Suresh, Ananda Theertha, Stich, Sebastian U., Talwalkar, Ameet, Wang, Hongyi, Woodworth, Blake, Wu, Shanshan, Yu, Felix X., Yuan, Honglin, Zaheer, Manzil, Zhang, Mi, Zhang, Tong, Zheng, Chunxiang, Zhu, Chen, Zhu, Wennan

Federated learning and analytics are a distributed approach for collaboratively learning models (or statistics) from decentralized data, motivated by and designed for privacy protection. The distributed learning process can be formulated as solving f

Externí odkaz: http://arxiv.org/abs/2107.06917

Zobrazit plný text záznamu

Report

An Even More Optimal Stochastic Optimization Algorithm: Minibatching and Interpolation Learning

Autor: Woodworth, Blake, Srebro, Nathan

We present and analyze an algorithm for optimizing smooth and convex or strongly convex objectives using minibatch stochastic gradient estimates. The algorithm is optimal with respect to its dependence on both the minibatch size and minimum expected

Externí odkaz: http://arxiv.org/abs/2106.02720

Zobrazit plný text záznamu

Report

On the Implicit Bias of Initialization Shape: Beyond Infinitesimal Mirror Descent

Autor: Azulay, Shahar, Moroshko, Edward, Nacson, Mor Shpigel, Woodworth, Blake, Srebro, Nathan, Globerson, Amir, Soudry, Daniel

Recent work has highlighted the role of initialization scale in determining the structure of the solutions that gradient methods converge to. In particular, it was shown that large initialization leads to the neural tangent kernel regime solution, wh

Externí odkaz: http://arxiv.org/abs/2102.09769

Zobrazit plný text záznamu

Report

The Min-Max Complexity of Distributed Stochastic Convex Optimization with Intermittent Communication

Autor: Woodworth, Blake, Bullins, Brian, Shamir, Ohad, Srebro, Nathan

We resolve the min-max complexity of distributed stochastic convex optimization (up to a log factor) in the intermittent communication setting, where $M$ machines work in parallel over the course of $R$ rounds of communication to optimize the objecti

Externí odkaz: http://arxiv.org/abs/2102.01583

Zobrazit plný text záznamu

Report

Implicit Bias in Deep Linear Classification: Initialization Scale vs Training Accuracy

Autor: Moroshko, Edward, Gunasekar, Suriya, Woodworth, Blake, Lee, Jason D., Srebro, Nathan, Soudry, Daniel

We provide a detailed asymptotic study of gradient flow trajectories and their implicit optimization bias when minimizing the exponential loss over "diagonal linear networks". This is the simplest model displaying a transition between "kernel" and no

Externí odkaz: http://arxiv.org/abs/2007.06738

Zobrazit plný text záznamu

Vyhledávací nástroje:

Upřesnit hledání