PNKH-B: A Projected Newton-Krylov Method for Large-Scale Bound-Constrained Optimization
Autor: | Kelvin K.W. Kan, Lars Ruthotto, Samy Wu Fung |
---|---|
Jazyk: | angličtina |
Rok vydání: | 2020 |
Předmět: |
Optimization problem
Scale (ratio) Estimation theory Applied Mathematics Constrained optimization MathematicsofComputing_NUMERICALANALYSIS 010103 numerical & computational mathematics Function (mathematics) Numerical Analysis (math.NA) 01 natural sciences 010101 applied mathematics Newton–Krylov method Computational Mathematics FOS: Mathematics Applied mathematics Mathematics - Numerical Analysis 0101 mathematics Mathematics |
Popis: | We present PNKH-B, a projected Newton-Krylov method for iteratively solving large-scale optimization problems with bound constraints. PNKH-B is geared toward situations in which function and gradient evaluations are expensive, and the (approximate) Hessian is only available through matrix-vector products. This is commonly the case in large-scale parameter estimation, machine learning, and image processing. In each iteration, PNKH-B uses a low-rank approximation of the (approximate) Hessian to determine the search direction and construct the metric used in a projected line search. The key feature of the metric is its consistency with the low-rank approximation of the Hessian on the Krylov subspace. This renders PNKH-B similar to a projected variable metric method. We present an interior point method to solve the quadratic projection problem efficiently. Since the interior point method effectively exploits the low-rank structure, its computational cost only scales linearly with respect to the number of variables, and it only adds negligible computational time. We also experiment with variants of PNKH-B that incorporate estimates of the active set into the Hessian approximation. We prove the global convergence to a stationary point under standard assumptions. Using three numerical experiments motivated by parameter estimation, machine learning, and image reconstruction, we show that the consistent use of the Hessian metric in PNKH-B leads to fast convergence, particularly in the first few iterations. We provide our MATLAB implementation at https://github.com/EmoryMLIP/PNKH-B. |
Databáze: | OpenAIRE |
Externí odkaz: |