Výsledky vyhledávání - "Sabach, Shoham"

Report

A Fast Algorithm for Convex Composite Bi-Level Optimization

Autor: Merchav, Roey, Sabach, Shoham, Teboulle, Marc

In this paper, we study convex bi-level optimization problems where both the inner and outer levels are given as a composite convex minimization. We propose the Fast Bi-level Proximal Gradient (FBi-PG) algorithm, which can be interpreted as applying

Externí odkaz: http://arxiv.org/abs/2407.21221

Zobrazit plný text záznamu

Report

Learning the Target Network in Function Space

Autor: Asadi, Kavosh, Liu, Yao, Sabach, Shoham, Yin, Ming, Fakoor, Rasool

We focus on the task of learning the value function in the reinforcement learning (RL) setting. This task is often solved by updating a pair of online and target networks while ensuring that the parameters of these two networks are equivalent. We pro

Externí odkaz: http://arxiv.org/abs/2406.01838

Zobrazit plný text záznamu

Akademický článek

Non-Convex Split Feasibility Problems: Models, Algorithms and Theory

Autor: Gibali, Aviv, Sabach, Shoham, Voldman, Sergey

Publikováno v: Open Journal of Mathematical Optimization, Vol 1, Iss , Pp 1-15 (2020)

In this paper, we propose a catalog of iterative methods for solving the Split Feasibility Problem in the non-convex setting. We study four different optimization formulations of the problem, where each model has advantages in different settings of t

Externí odkaz: https://doaj.org/article/b1b1fdde862241498a917bc66ed647f1

Zobrazit plný text záznamu

Report

MADA: Meta-Adaptive Optimizers through hyper-gradient Descent

Autor: Ozkara, Kaan, Karakus, Can, Raman, Parameswaran, Hong, Mingyi, Sabach, Shoham, Kveton, Branislav, Cevher, Volkan

Following the introduction of Adam, several novel adaptive optimizers for deep learning have been proposed. These optimizers typically excel in some tasks but may not outperform Adam uniformly across all tasks. In this work, we introduce Meta-Adaptiv

Externí odkaz: http://arxiv.org/abs/2401.08893

Zobrazit plný text záznamu

Report

Krylov Cubic Regularized Newton: A Subspace Second-Order Method with Dimension-Free Convergence Rate

Autor: Jiang, Ruichen, Raman, Parameswaran, Sabach, Shoham, Mokhtari, Aryan, Hong, Mingyi, Cevher, Volkan

Second-order optimization methods, such as cubic regularized Newton methods, are known for their rapid convergence rates; nevertheless, they become impractical in high-dimensional problems due to their substantial memory requirements and computationa

Externí odkaz: http://arxiv.org/abs/2401.03058

Zobrazit plný text záznamu

Report

TAIL: Task-specific Adapters for Imitation Learning with Large Pretrained Models

Autor: Liu, Zuxin, Zhang, Jesse, Asadi, Kavosh, Liu, Yao, Zhao, Ding, Sabach, Shoham, Fakoor, Rasool

The full potential of large pretrained models remains largely untapped in control domains like robotics. This is mainly because of the scarcity of data and the computational challenges associated with training or fine-tuning these large models for su

Externí odkaz: http://arxiv.org/abs/2310.05905

Zobrazit plný text záznamu

Report

Convex Bi-Level Optimization Problems with Non-smooth Outer Objective Function

Autor: Merchav, Roey, Sabach, Shoham

In this paper, we propose the Bi-Sub-Gradient (Bi-SG) method, which is a generalization of the classical sub-gradient method to the setting of convex bi-level optimization problems. This is a first-order method that is very easy to implement in the s

Externí odkaz: http://arxiv.org/abs/2307.08245

Zobrazit plný text záznamu

Report

Resetting the Optimizer in Deep RL: An Empirical Study

Autor: Asadi, Kavosh, Fakoor, Rasool, Sabach, Shoham

We focus on the task of approximating the optimal value function in deep reinforcement learning. This iterative process is comprised of solving a sequence of optimization problems where the loss function changes per iteration. The common approach to

Externí odkaz: http://arxiv.org/abs/2306.17833

Zobrazit plný text záznamu

Report

TD Convergence: An Optimization Perspective

Autor: Asadi, Kavosh, Sabach, Shoham, Liu, Yao, Gottesman, Omer, Fakoor, Rasool

We study the convergence behavior of the celebrated temporal-difference (TD) learning algorithm. By looking at the algorithm through the lens of optimization, we first argue that TD can be viewed as an iterative optimization algorithm where the funct

Externí odkaz: http://arxiv.org/abs/2306.17750

Zobrazit plný text záznamu

Report

Faster Projection-Free Augmented Lagrangian Methods via Weak Proximal Oracle

Autor: Garber, Dan, Livney, Tsur, Sabach, Shoham

This paper considers a convex composite optimization problem with affine constraints, which includes problems that take the form of minimizing a smooth convex objective function over the intersection of (simple) convex sets, or regularized with multi

Externí odkaz: http://arxiv.org/abs/2210.13968

Zobrazit plný text záznamu

Vyhledávací nástroje:

Upřesnit hledání