Optimization and Control

2025-05-15 | | Total: 20

#1 Single-loop $\mathcal{O}(ε^{-3})$ stochastic smoothing algorithms for nonsmooth Riemannian optimization [PDF1] [Copy] [Kimi] [REL]

Authors: Kangkang Deng, Zheng Peng, Weihe Wu

In this paper, we develop two Riemannian stochastic smoothing algorithms for nonsmooth optimization problems on Riemannian manifolds, addressing distinct forms of the nonsmooth term \( h \). Both methods combine dynamic smoothing with a momentum-based variance reduction scheme in a fully online manner. When \( h \) is Lipschitz continuous, we propose an stochastic algorithm under adaptive parameter that achieves the optimal iteration complexity of \( \mathcal{O}(\epsilon^{-3}) \), improving upon the best-known rates for exist algorithms. When \( h \) is the indicator function of a convex set, we design a new algorithm using truncated momentum, and under a mild error bound condition with parameter \( \theta \geq 1 \), we establish a complexity of \( \tilde{\mathcal{O}}(\epsilon^{-\max\{\theta+2, 2\theta\}}) \), in line with the best-known results in the Euclidean setting. Both algorithms feature a single-loop design with low per-iteration cost and require only \( \mathcal{O}(1) \) samples per iteration, ensuring that sample and iteration complexities coincide. Our framework encompasses a broad class of problems and recovers or matches optimal complexity guarantees in several important settings, including smooth stochastic Riemannian optimization, composite problems in Euclidean space, and constrained optimization via indicator functions.

Subject: Optimization and Control

Publish: 2025-05-14 15:36:49 UTC


#2 On some applications of the Boundary Control method to spectral estimation and inverse problems [PDF] [Copy] [Kimi] [REL]

Authors: S. A. Avdonin, A. S. Mikhaylov, A. S. Mikhaylov

We consider applications of the Boundary Control (BC) method to generalized spectral estimation problems and to inverse source problems. We derive the equations of the BC method for this problems and show that solvability of this equations crucially depends on the controllability properties of the corresponding dynamical system and properties of corresponding families of exponentials.

Subjects: Optimization and Control , Spectral Theory

Publish: 2025-05-14 14:06:07 UTC


#3 A Learning-Based Inexact ADMM for Solving Quadratic Programs [PDF] [Copy] [Kimi] [REL]

Authors: Xi Gao, Jinxin Xiong, Linxin Yang, Akang Wang, Weiwei Xu, Jiang Xue

Convex quadratic programs (QPs) constitute a fundamental computational primitive across diverse domains including financial optimization, control systems, and machine learning. The alternating direction method of multipliers (ADMM) has emerged as a preferred first-order approach due to its iteration efficiency - exemplified by the state-of-the-art OSQP solver. Machine learning-enhanced optimization algorithms have recently demonstrated significant success in speeding up the solving process. This work introduces a neural-accelerated ADMM variant that replaces exact subproblem solutions with learned approximations through a parameter-efficient Long Short-Term Memory (LSTM) network. We derive convergence guarantees within the inexact ADMM formalism, establishing that our learning-augmented method maintains primal-dual convergence while satisfying residual thresholds. Extensive experimental results demonstrate that our approach achieves superior solution accuracy compared to existing learning-based methods while delivering significant computational speedups of up to $7\times$, $28\times$, and $22\times$ over Gurobi, SCS, and OSQP, respectively. Furthermore, the proposed method outperforms other learning-to-optimize methods in terms of solution quality. Detailed performance analysis confirms near-perfect compliance with the theoretical assumptions, consequently ensuring algorithm convergence.

Subject: Optimization and Control

Publish: 2025-05-14 13:44:05 UTC


#4 Safe Primal-Dual Optimization with a Single Smooth Constraint [PDF] [Copy] [Kimi] [REL]

Authors: Ilnura Usmanova, Kfir Yehuda Levy

This paper addresses the problem of safe optimization under a single smooth constraint, a scenario that arises in diverse real-world applications such as robotics and autonomous navigation. The objective of safe optimization is to solve a black-box minimization problem while strictly adhering to a safety constraint throughout the learning process. Existing methods often suffer from high sample complexity due to their noise sensitivity or poor scalability with number of dimensions, limiting their applicability. We propose a novel primal-dual optimization method that, by carefully adjusting dual step-sizes and constraining primal updates, ensures the safety of both primal and dual sequences throughout the optimization. Our algorithm achieves a convergence rate that significantly surpasses current state-of-the-art techniques. Furthermore, to the best of our knowledge, it is the first primal-dual approach to guarantee safe updates. Simulations corroborate our theoretical findings, demonstrating the practical benefits of our method. We also show how the method can be extended to multiple constraints.

Subject: Optimization and Control

Publish: 2025-05-14 12:54:29 UTC


#5 Stochastic Optimal Control for Systems with Drifts of Bounded Variation: A Maximum Principle Approach [PDF] [Copy] [Kimi] [REL]

Authors: Antoine Marie Bogso, Rhoss Likibi Pellat, Donatien Kuissi Kamdem, Olivier Menoukeu Pamen

In this paper, we study the problem of stochastic optimal control for systems governed by stochastic differential equations (SDEs) with drift coefficients of bounded variation. We establish both necessary and sufficient stochastic maximum principle. To achieve this, we prove the existence and uniqueness of solutions to SDEs with random drifts of bounded variation. We then show that these solutions are Sobolev differentiable with respect to their initial conditions and provide an explicit representation involving integrals with respect to the local time of the state process. We handle the irregular drift, by constructing a sequence of approximating control problems with smooth coefficients. By applying Ekeland's variational principle, we obtain a sequence of adjoint processes, which we then use to derive the maximum principle by taking the limit.

Subjects: Optimization and Control , Probability

Publish: 2025-05-14 11:52:59 UTC


#6 Distributed Stochastic Optimization for Non-Smooth and Weakly Convex Problems under Heavy-Tailed Noise [PDF] [Copy] [Kimi] [REL]

Authors: Jun Hu, Chao Sun, Bo Chen, Jianzheng Wang, Zheming Wang

In existing distributed stochastic optimization studies, it is usually assumed that the gradient noise has a bounded variance. However, recent research shows that the heavy-tailed noise, which allows an unbounded variance, is closer to practical scenarios in many tasks. Under heavy-tailed noise, traditional optimization methods, such as stochastic gradient descent, may have poor performance and even diverge. Thus, it is of great importance to study distributed stochastic optimization algorithms applicable to the heavy-tailed noise scenario. However, most of the existing distributed algorithms under heavy-tailed noise are developed for convex and smooth problems, which limits their applications. This paper proposes a clipping-based distributed stochastic algorithm under heavy-tailed noise that is suitable for non-smooth and weakly convex problems. The convergence of the proposed algorithm is proven, and the conditions on the parameters are given. A numerical experiment is conducted to demonstrate the effectiveness of the proposed algorithm.

Subject: Optimization and Control

Publish: 2025-05-14 10:59:53 UTC


#7 Optimization via First-Order Switching Methods: Skew-Symmetric Dynamics and Optimistic Discretization [PDF] [Copy] [Kimi] [REL]

Authors: Antesh Upadhyay, Sang Bin Moon, Abolfazl Hashemi

Large-scale constrained optimization problems are at the core of many tasks in control, signal processing, and machine learning. Notably, problems with functional constraints arise when, beyond a performance{\nobreakdash-}centric goal (e.g., minimizing the empirical loss), one desires to satisfy other requirements such as robustness, fairness, etc. A simple method for such problems, which remarkably achieves optimal rates for non-smooth, convex, strongly convex, and weakly convex functions under first-order oracle, is Switching Gradient Method (SGM): in each iteration depending on a predetermined constraint violation tolerance, use the gradient of objective or the constraint as the update vector. While the performance of SGM is well-understood for non-smooth functions and in fact matches its unconstrained counterpart, i.e., Gradient Descent (GD), less is formally established about its convergence properties under the smoothness of loss and constraint functions. In this work, we aim to fill this gap. First, we show that SGM may not benefit from faster rates under smoothness, in contrast to improved rates for GD under smoothness. By taking a continuous-time limit perspective, we show the issue is fundamental to SGM's dynamics and not an artifact of our analysis. Our continuous-time limit perspective further provides insights towards alleviating SGM's shortcomings. Notably, we show that leveraging the idea of optimism, a well-explored concept in variational inequalities and min-max optimization, could lead to faster methods. This perspective further enables designing a new class of ``soft'' switching methods, for which we further analyze their iteration complexity under mild assumptions.

Subject: Optimization and Control

Publish: 2025-05-14 05:04:49 UTC


#8 Derivative-free optimization is competitive for aerodynamic design optimization in moderate dimensions [PDF] [Copy] [Kimi] [REL]

Authors: Punya Plaban, Peter Bachman, Ashwin Renganathan

Aerodynamic design optimization is an important problem in aircraft design that depends on the interplay between a numerical optimizer and a high-fidelity flow physics solver. Derivative-based, first and (quasi) second order, optimization techniques are the de facto choice, particularly given the availability of the adjoint method and its ability to efficiently compute gradients at the cost of just one solution of the forward problem. However, implementation of the adjoint method requires careful mathematical treatment, and its sensitivity to changes in mesh quality limits widespread applicability. Derivative-free approaches are often overlooked for large scale optimization, citing their lack of scalability in higher dimensions and/or the lack of practical interest in globally optimal solutions that they often target. However, breaking free from an adjoint solver can be paradigm-shifting in broadening the applicability of aerodynamic design optimization. We provide a systematic benchmarking of a select sample of widely used derivative-based and derivative-free optimization algorithms on the design optimization of three canonical aerodynamic bodies, namely, the NACA0012 and RAE2822 airfoils, and the ONERAM6 wing. Our results demonstrate that derivative-free methods are competitive with derivative-based methods, while outperforming them consistently in the high-dimensional setting. These findings highlight the practical competitiveness of modern derivative-free strategies, offering a scalable and robust alternative for aerodynamic design optimization when adjoint-based gradients are unavailable or unreliable.

Subjects: Optimization and Control , Numerical Analysis

Publish: 2025-05-14 02:51:02 UTC


#9 An accelerated proximal PRS-SQP algorithm with dual ascent-descent procedures for smooth composite optimization [PDF] [Copy] [Kimi] [REL]

Authors: Jiachen Jin, Guodong Ma, Jinbao Jian

Conventional wisdom in composite optimization suggests augmented Lagrangian dual ascent (ALDA) in Peaceman-Rachford splitting (PRS) methods for dual feasibility. However, ALDA may fail when the primal iterate is a local minimum, a stationary point, or a coordinatewise solution of the highly nonconvex augmented Lagrangian function. Splitting sequential quadratic programming (SQP) methods utilize augmented Lagrangian dual descent (ALDD) to directly minimize the primal residual, circumventing the limitations of ALDA and achieving faster convergence in smooth optimization. This paper aims to present a fairly accessible generalization of two contrasting dual updates, ALDA and ALDD, for smooth composite optimization. A key feature of our PRS-SQP algorithm is its dual ascent-descent procedure, which provides a free direction rule for the dual updates and a new insight to explain the counterintuitive convergence behavior. Furthermore, we incorporate a hybrid acceleration technique that combines inertial extrapolation and back substitution to improve convergence. Theoretically, we establish the feasibility for a wider range of acceleration factors than previously known and derive convergence rates within the Kurdyka- Lojasiewicz framework. Numerical experiments validate the effectiveness and stability of the proposed method in various dual-update scenarios.

Subject: Optimization and Control

Publish: 2025-05-14 02:23:48 UTC


#10 Reflected stochastic recursive control problems with jumps: dynamic programming and stochastic verification theorems [PDF] [Copy] [Kimi] [REL]

Authors: Lu Liu, Qingmeng Wei

This paper mainly investigates reflected stochastic recursive control problems governed by jump-diffusion dynamics. The system's state evolution is described by a stochastic differential equation driven by both Brownian motion and Poisson random measures, while the recursive cost functional is formulated via the solution process Y of a reflected backward stochastic differential equation driven by the same dual stochastic sources. By establishing the dynamic programming principle, we provide the probabilistic interpretation of an obstacle problem for partial integro-differential equations of Hamilton-Jacobi-Bellman type in the viscosity solution sense through our control problem's value function. Furthermore, the value function is proved to inherit the semi-concavity and joint Lipschitz continuity in state and time coordinates, which play key roles in deriving stochastic verification theorems of control problem within the framework of viscosity solutions. We remark that some restrictions in previous study are eliminated, such as the frozen of the reflected processes in time and state, and the independence of the driver from diffusion variables.

Subject: Optimization and Control

Publish: 2025-05-14 02:11:29 UTC


#11 Solving Reach- and Stabilize-Avoid Problems Using Discounted Reachability [PDF] [Copy] [Kimi] [REL]

Authors: Boyang Li, Zheng Gong, Sylvia Herbert

In this article, we consider the infinite-horizon reach-avoid (RA) and stabilize-avoid (SA) zero-sum game problems for general nonlinear continuous-time systems, where the goal is to find the set of states that can be controlled to reach or stabilize to a target set, without violating constraints even under the worst-case disturbance. Based on the Hamilton-Jacobi reachability method, we address the RA problem by designing a new Lipschitz continuous RA value function, whose zero sublevel set exactly characterizes the RA set. We establish that the associated Bellman backup operator is contractive and that the RA value function is the unique viscosity solution of a Hamilton-Jacobi variational inequality. Finally, we develop a two-step framework for the SA problem by integrating our RA strategies with a recently proposed Robust Control Lyapunov-Value Function, thereby ensuring both target reachability and long-term stability. We numerically verify our RA and SA frameworks on a 3D Dubins car system to demonstrate the efficacy of the proposed approach.

Subjects: Optimization and Control , Robotics , Systems and Control

Publish: 2025-05-14 02:04:57 UTC


#12 The Adaptive Complexity of Finding a Stationary Point [PDF] [Copy] [Kimi] [REL]

Authors: Huanjian Zhou, Andi Han, Akiko Takeda, Masashi Sugiyama

In large-scale applications, such as machine learning, it is desirable to design non-convex optimization algorithms with a high degree of parallelization. In this work, we study the adaptive complexity of finding a stationary point, which is the minimal number of sequential rounds required to achieve stationarity given polynomially many queries executed in parallel at each round. For the high-dimensional case, i.e., $d = \widetilde{\Omega}(\varepsilon^{-(2 + 2p)/p})$, we show that for any (potentially randomized) algorithm, there exists a function with Lipschitz $p$-th order derivatives such that the algorithm requires at least $\varepsilon^{-(p+1)/p}$ iterations to find an $\varepsilon$-stationary point. Our lower bounds are tight and show that even with $\mathrm{poly}(d)$ queries per iteration, no algorithm has better convergence rate than those achievable with one-query-per-round algorithms. In other words, gradient descent, the cubic-regularized Newton's method, and the $p$-th order adaptive regularization method are adaptively optimal. Our proof relies upon novel analysis with the characterization of the output for the hardness potentials based on a chain-like structure with random partition. For the constant-dimensional case, i.e., $d = \Theta(1)$, we propose an algorithm that bridges grid search and gradient flow trapping, finding an approximate stationary point in constant iterations. Its asymptotic tightness is verified by a new lower bound on the required queries per iteration. We show there exists a smooth function such that any algorithm running with $\Theta(\log (1/\varepsilon))$ rounds requires at least $\widetilde{\Omega}((1/\varepsilon)^{(d-1)/2})$ queries per round. This lower bound is tight up to a logarithmic factor, and implies that the gradient flow trapping is adaptively optimal.

Subjects: Optimization and Control , Computational Complexity , Distributed, Parallel, and Cluster Computing

Publish: 2025-05-14 00:54:49 UTC


#13 Aging-Aware Battery Control via Convex Optimization [PDF] [Copy] [Kimi] [REL]

Authors: Obidike Nnorom Jr., Giray Ogut, Stephen Boyd, Philip Levis

We consider the task of controlling a battery while balancing two competing objectives that evolve over different time scales. The short-term objective, such as arbitrage or load smoothing, improves with more battery cycling, while the long-term objective is to maximize battery lifetime, which discourages cycling. Using a semi-empirical aging model, we formulate this problem as a convex optimization problem. We use model predictive control (MPC) with a convex approximation of aging dynamics to optimally manage the trade-off between performance and degradation. Through simulations, we quantify this trade-off in both economic and smoothing applications.

Subjects: Optimization and Control , Systems and Control

Publish: 2025-05-13 23:56:16 UTC


#14 Agency Problems and Adversarial Bilevel Optimization under Uncertainty and Cyber Threats [PDF] [Copy] [Kimi] [REL]

Authors: Thibaut Mastrolia, William Yan

We study an agency problem between a holding company and its subsidiary, exposed to cyber threats that affect the overall value of the subsidiary. The holding company seeks to design an optimal incentive scheme to mitigate these losses. In response, the subsidiary selects an optimal cybersecurity investment strategy, modeled through a stochastic epidemiological SIR (Susceptible-Infected-Recovered) framework. The cyber threat landscape is captured through an L-hop risk framework with two primary sources of risk: (i) internal risk propagation via the contagion parameters in the SIR model, and (ii) external cyberattacks from a malicious external hacker. The uncertainty and adversarial nature of the hacking lead to consider a robust stochastic control approach that allows for increased volatility and ambiguity induced by cyber incidents. The agency problem is formulated as a max-min bilevel stochastic control problem with accidents. First, we derive the incentive compatibility condition by reducing the subsidiary's optimal response to the solution of a second-order backward stochastic differential equation with jumps. Next, we demonstrate that the principal's problem can be equivalently reformulated as an integro-partial Hamilton-Jacobi-Bellman-Isaacs (HJBI) equation. By extending the stochastic Perron's method to our setting, we show that the value function of the problem is the unique viscosity solution to the resulting integro-partial HJBI equation.

Subjects: Optimization and Control , Probability

Publish: 2025-05-13 22:00:10 UTC


#15 SAD Neural Networks: Divergent Gradient Flows and Asymptotic Optimality via o-minimal Structures [PDF] [Copy] [Kimi] [REL]

Authors: Julian Kranz, Davide Gallon, Steffen Dereich, Arnulf Jentzen

We study gradient flows for loss landscapes of fully connected feed forward neural networks with commonly used continuously differentiable activation functions such as the logistic, hyperbolic tangent, softplus or GELU function. We prove that the gradient flow either converges to a critical point or diverges to infinity while the loss converges to an asymptotic critical value. Moreover, we prove the existence of a threshold $\varepsilon>0$ such that the loss value of any gradient flow initialized at most $\varepsilon$ above the optimal level converges to it. For polynomial target functions and sufficiently big architecture and data set, we prove that the optimal loss value is zero and can only be realized asymptotically. From this setting, we deduce our main result that any gradient flow with sufficiently good initialization diverges to infinity. Our proof heavily relies on the geometry of o-minimal structures. We confirm these theoretical findings with numerical experiments and extend our investigation to real-world scenarios, where we observe an analogous behavior.

Subjects: Machine Learning , Logic , Optimization and Control , Machine Learning

Publish: 2025-05-14 17:15:11 UTC


#16 Primal-dual splitting methods for phase-field surfactant model with moving contact lines [PDF] [Copy] [Kimi] [REL]

Authors: Wei Wu, Zhen Zhang, Chaozhen Wei

Surfactants have important effects on the dynamics of droplets on solid surfaces, which has inspired many industrial applications. Phase-field surfactant model with moving contact lines (PFS-MCL) has been employed to investigate the complex droplet dynamics with surfactants, while its numerical simulation remains challenging due to the coupling of gradient flows with respect to transport distances involving nonlinear and degenerate mobilities. We propose a novel structure-preserving variational scheme for PFS-MCL model with the dynamic boundary condition based on the minimizing movement scheme and optimal transport theory for Wasserstein gradient flows. The proposed scheme consists of a series of convex minimization problems and can be efficiently solved by our proposed primal-dual splitting method and its accelerated versions. By respecting the underlying PDE's variational structure with respect to the transport distance, the proposed scheme is proved to inherits the desirable properties including original energy dissipation, bound-preserving, and mass conservation. Through a suite of numerical simulations, we validate the performance of the proposed scheme and investigate the effects of surfactants on the droplet dynamics.

Subjects: Numerical Analysis , Optimization and Control , Computational Physics

Publish: 2025-05-14 15:20:53 UTC


#17 Absense of loops for the Wasserstein-$\mathcal{H}^1$ problem: the localization/blow-up argument [PDF] [Copy] [Kimi] [REL]

Author: João Miguel Machado

In the present work we prove that minimizers of the Wasserstein-$\mathscr{H}^1$ problem, introduced recently by Chambolle et. al., are trees in two cases: when the target measure is a sum of finitely many Dirac masses or when it has a bounded density.

Subjects: Analysis of PDEs , Optimization and Control

Publish: 2025-05-14 09:09:46 UTC


#18 Birch SGD: A Tree Graph Framework for Local and Asynchronous SGD Methods [PDF] [Copy] [Kimi] [REL]

Authors: Alexander Tyurin, Danil Sivtsov

We propose a new unifying framework, Birch SGD, for analyzing and designing distributed SGD methods. The central idea is to represent each method as a weighted directed tree, referred to as a computation tree. Leveraging this representation, we introduce a general theoretical result that reduces convergence analysis to studying the geometry of these trees. This perspective yields a purely graph-based interpretation of optimization dynamics, offering a new and intuitive foundation for method development. Using Birch SGD, we design eight new methods and analyze them alongside previously known ones, with at least six of the new methods shown to have optimal computational time complexity. Our research leads to two key insights: (i) all methods share the same "iteration rate" of $O\left(\frac{(R + 1) L \Delta}{\varepsilon} + \frac{\sigma^2 L \Delta}{\varepsilon^2}\right)$, where $R$ the maximum "tree distance" along the main branch of a tree; and (ii) different methods exhibit different trade-offs-for example, some update iterates more frequently, improving practical performance, while others are more communication-efficient or focus on other aspects. Birch SGD serves as a unifying framework for navigating these trade-offs. We believe these results provide a unified foundation for understanding, analyzing, and designing efficient asynchronous and parallel optimization methods.

Subjects: Machine Learning , Distributed, Parallel, and Cluster Computing , Optimization and Control

Publish: 2025-05-14 08:37:45 UTC


#19 Learning Cocoercive Conservative Denoisers via Helmholtz Decomposition for Poisson Inverse Problems [PDF] [Copy] [Kimi] [REL]

Authors: Deliang Wei, Peng Chen, Haobo Xu, Jiale Yao, Fang Li, Tieyong Zeng

Plug-and-play (PnP) methods with deep denoisers have shown impressive results in imaging problems. They typically require strong convexity or smoothness of the fidelity term and a (residual) non-expansive denoiser for convergence. These assumptions, however, are violated in Poisson inverse problems, and non-expansiveness can hinder denoising performance. To address these challenges, we propose a cocoercive conservative (CoCo) denoiser, which may be (residual) expansive, leading to improved denoising. By leveraging the generalized Helmholtz decomposition, we introduce a novel training strategy that combines Hamiltonian regularization to promote conservativeness and spectral regularization to ensure cocoerciveness. We prove that CoCo denoiser is a proximal operator of a weakly convex function, enabling a restoration model with an implicit weakly convex prior. The global convergence of PnP methods to a stationary point of this restoration model is established. Extensive experimental results demonstrate that our approach outperforms closely related methods in both visual quality and quantitative metrics.

Subjects: Computer Vision and Pattern Recognition , Machine Learning , Functional Analysis , Optimization and Control

Publish: 2025-05-13 19:00:55 UTC


#20 RDA-PSO: A computational method to quantify the diffusive dispersal of insects [PDF] [Copy] [Kimi] [REL]

Authors: Lidia Mrad, Joceline Lega

This article introduces a computational method, called "Recapture of Diffusive Agents & Particle Swarm Optimization" (RDA-PSO), designed to estimate the dispersal parameter of diffusive insects in mark-release-recapture (MRR) field experiments. In addition to describing the method, its properties are discussed, with particular focus on robustness in estimating the observed diffusion coefficient. It is shown that RDA-PSO provides a simple and reliable approach to quantify mosquito dispersal that outperforms other techniques based on the solution of the diffusion equation, three of which are also introduced in this work. Examples of application to both synthetic and real field data for the yellow fever mosquito are provided.

Subjects: Quantitative Methods , Optimization and Control , Populations and Evolution

Publish: 2025-05-13 15:46:37 UTC