Zobrazeno 1 - 10
of 22 544
pro vyhledávání: '"global optimality"'
Autor:
Aarset, Christian
We investigate optimality conditions for the sensor placement problem within optimal experimental design, wherein one must decide on the optimal manner in which a fixed number of sensors can be arranged over a large number of candidate locations. By
Externí odkaz:
http://arxiv.org/abs/2410.16590
In recent years, many estimation problems in robotics have been shown to be solvable to global optimality using their semidefinite relaxations. However, the runtime complexity of off-the-shelf semidefinite programming (SDP) solvers is up to cubic in
Externí odkaz:
http://arxiv.org/abs/2406.02365
Autor:
Xiao, Quan, Chen, Tianyi
Bilevel optimization has witnessed a resurgence of interest, driven by its critical role in trustworthy and efficient machine learning applications. Recent research has focused on proposing efficient methods with provable convergence guarantees. Howe
Externí odkaz:
http://arxiv.org/abs/2408.16087
The Moving Sofa Problem, formally proposed by Leo Moser in 1966, seeks to determine the largest area of a two-dimensional shape that can navigate through an $L$-shaped corridor with unit width. The current best lower bound is about 2.2195, achieved b
Externí odkaz:
http://arxiv.org/abs/2407.11106
Autor:
Patel, Bhrij, Suttle, Wesley A., Koppel, Alec, Aggarwal, Vaneet, Sadler, Brian M., Bedi, Amrit Singh, Manocha, Dinesh
In the context of average-reward reinforcement learning, the requirement for oracle knowledge of the mixing time, a measure of the duration a Markov chain under a fixed policy needs to achieve its stationary distribution, poses a significant challeng
Externí odkaz:
http://arxiv.org/abs/2403.11925
Autor:
Orbanz, Peter
Consider a convex function that is invariant under an group of transformations. If it has a minimizer, does it also have an invariant minimizer? Variants of this problem appear in nonparametric statistics and in a number of adjacent fields. The answe
Externí odkaz:
http://arxiv.org/abs/2402.07613
Direct policy search has achieved great empirical success in reinforcement learning. Many recent studies have revisited its theoretical foundation for continuous control, which reveals elegant nonconvex geometry in various benchmark problems, especia
Externí odkaz:
http://arxiv.org/abs/2312.15332
Proximal Policy Optimization algorithm employing a clipped surrogate objective (PPO-Clip) is a prominent exemplar of the policy optimization methods. However, despite its remarkable empirical success, PPO-Clip lacks theoretical substantiation to date
Externí odkaz:
http://arxiv.org/abs/2312.12065
Reconstruction of interaction network between random events is a critical problem arising from statistical physics and politics to sociology, biology, and psychology, and beyond. The Ising model lays the foundation for this reconstruction process, bu
Externí odkaz:
http://arxiv.org/abs/2310.09257
Akademický článek
Tento výsledek nelze pro nepřihlášené uživatele zobrazit.
K zobrazení výsledku je třeba se přihlásit.
K zobrazení výsledku je třeba se přihlásit.