https://papers.cool/arxiv/math.OCOptimization and Control2024-08-15T00:00:00+00:00python-feedgenCool Papers - Immersive Paper Discoveryhttps://papers.cool/arxiv/2408.07086Quantum algorithms for optimizers2024-08-15T00:00:00+00:00Giacomo NanniciniThis is a set of lecture notes for a Ph.D.-level course on quantum algorithms, with an emphasis on quantum optimization algorithms. It is developed for applied mathematicians and engineers, and requires no previous background in quantum mechanics. The main topics of this course, in addition to a rigorous introduction to the computational model, are: input/output models, quantum search, the quantum gradient algorithm, matrix manipulation algorithms, the matrix multiplicative weights update framework for semidefinite optimization, adiabatic optimization.https://papers.cool/arxiv/2408.07192Solving Truly Massive Budgeted Monotonic POMDPs with Oracle-Guided Meta-Reinforcement Learning2024-08-15T00:00:00+00:00Manav VoraMichael N GrussingMelkior OrnikMonotonic Partially Observable Markov Decision Processes (POMDPs), where the system state progressively decreases until a restorative action is performed, can be used to model sequential repair problems effectively. This paper considers the problem of solving budget-constrained multi-component monotonic POMDPs, where a finite budget limits the maximal number of restorative actions. For a large number of components, solving such a POMDP using current methods is computationally intractable due to the exponential growth in the state space with an increasing number of components. To address this challenge, we propose a two-step approach. Since the individual components of a budget-constrained multi-component monotonic POMDP are only connected via the shared budget, we first approximate the optimal budget allocation among these components using an approximation of each component POMDP's optimal value function which is obtained through a random forest model. Subsequently, we introduce an oracle-guided meta-trained Proximal Policy Optimization (PPO) algorithm to solve each of the independent budget-constrained single-component monotonic POMDPs. The oracle policy is obtained by performing value iteration on the corresponding monotonic Markov Decision Process (MDP). This two-step method provides scalability in solving truly massive multi-component monotonic POMDPs. To demonstrate the efficacy of our approach, we consider a real-world maintenance scenario that involves inspection and repair of an administrative building by a team of agents within a maintenance budget. Finally, we perform a computational complexity analysis for a varying number of components to show the scalability of the proposed approach.https://papers.cool/arxiv/2408.07431Strategies for optimizing double-bracket quantum algorithms2024-08-15T00:00:00+00:00Li XiaoyueMatteo RobbiatiAndrea PasqualeEdoardo PedicilloAndrew WrightStefano CarrazzaMarek GluzaRecently double-bracket quantum algorithms have been proposed as a way to compile circuits for approximating eigenstates. Physically, they consist of appropriately composing evolutions under an input Hamiltonian together with diagonal evolutions. Here, we present strategies to optimize the choice of the double-bracket evolutions to enhance the diagonalization efficiency. This can be done by finding optimal generators and durations of the evolutions. We present numerical results regarding the preparation of double-bracket iterations, both in ideal cases where the algorithm's setup provides analytical convergence guarantees and in more heuristic cases, where we use an adaptive and variational approach to optimize the generators of the evolutions. As an example, we discuss the efficacy of these optimization strategies when considering a spin-chain Hamiltonian as the target. To propose algorithms that can be executed starting today, fully aware of the limitations of the quantum technologies at our disposal, we finally present a selection of diagonal evolution parametrizations that can be directly compiled into CNOTs and single-qubit rotation gates. We discuss the advantages and limitations of this compilation and propose a way to take advantage of this approach when used in synergy with other existing methods.https://papers.cool/arxiv/2408.07568Steady-State Cascade Operators and their Role in Linear Control, Estimation, and Model Reduction Problems2024-08-15T00:00:00+00:00John W. Simpson-PorcoDaniele AstolfiGiordano ScarciottiCertain linear matrix operators arise naturally in systems analysis and design problems involving cascade interconnections of linear time-invariant systems, including problems of stabilization, estimation, and model order reduction. We conduct here a comprehensive study of these operators and their relevant system-theoretic properties. The general theory is then leveraged to delineate both known and new design methodologies for control, estimation, and model reduction. Several entirely new designs arise from this systematic categorization, including new recursive and low-gain design frameworks for observation of cascaded systems. The benefits of the results beyond the linear time-invariant setting are demonstrated through preliminary extensions for nonlinear systems, with an outlook towards the development of a similarly comprehensive nonlinear theory.https://papers.cool/arxiv/2408.07616Prophet Inequalities: Competing with the Top $\ell$ Items is Easy2024-08-15T00:00:00+00:00Mathieu MolinaNicolas GastPatrick LoiseauVianney PerchetWe explore a novel variant of the classical prophet inequality problem, where the values of a sequence of items are drawn i.i.d. from some distribution, and an online decision maker must select one item irrevocably. We establish that the competitive ratio between the expected optimal performance of the online decision maker compared to that of a prophet, who uses the average of the top $\ell$ items, must be greater than $\ell/c_{\ell}$, with $c_{\ell}$ the solution to an integral equation. We prove that this lower bound is larger than $1-1/(\exp(\ell)-1)$. This implies that the bound converges exponentially fast to $1$ as $\ell$ grows. In particular, the bound for $\ell=2$ is $2/c_{2} \approx 0.966$ which is much closer to $1$ than the classical bound of $0.745$ for $\ell=1$. Additionally, the proposed algorithm can be extended to a more general scenario, where the decision maker is permitted to select $k$ items. This subsumes the $k$ multi-unit i.i.d. prophet problem and provides the current best asymptotic guarantees, as well as enables broader understanding in the more general framework. Finally, we prove a nearly tight competitive ratio when only static threshold policies are allowed.https://papers.cool/arxiv/2408.07143Optimal Experimental Design for Universal Differential Equations2024-08-15T00:00:00+00:00Christoph PlateCarl Julius MartensenSebastian SagerComplex dynamic systems are typically either modeled using expert knowledge in the form of differential equations, or via data-driven universal approximation models such as artificial neural networks (ANN). While the first approach has advantages with respect to interpretability, transparency, data-efficiency, and extrapolation, the second approach is able to learn completely unknown functional relations from data and may result in models that can be evaluated more efficiently. To combine the complementary advantages, universal differential equations (UDE) have been suggested, which replace unknown terms in the differential equations with ANN. These hybrid models allow to both encode prior domain knowledge such as first principles and to learn unknown mechanisms from data. Often, data for the training of UDE can only be obtained via costly experiments. We consider optimal experimental design (OED) for the planning of experiments and generation of data needed to train UDE. The number of weights in the embedded ANN usually leads to an overfitting of the regression problem. To make the OED problem tractable for optimization, we propose and compare dimension reduction methods that are based on lumping of weights and singular value decomposition of the Fisher information matrix (FIM), respectively. They result in lower-dimensional variational differential equations which are easier to solve and which yield regular FIM. Our numerical results showcase the advantages of OED for UDE, such as an increased data-efficiency and better extrapolation properties.https://papers.cool/arxiv/2408.07182Proximal random reshuffling under local Lipschitz continuity2024-08-15T00:00:00+00:00Cedric JoszLexiao LaiXiaopeng LiWe study proximal random reshuffling for minimizing the sum of locally Lipschitz functions and a proper lower semicontinuous convex function without assuming coercivity or the existence of limit points. The algorithmic guarantees pertaining to near approximate stationarity rely on a new tracking lemma linking the iterates to trajectories of conservative fields. One of the novelties in the analysis consists in handling conservative fields with unbounded values.https://papers.cool/arxiv/2408.07206Equivalence of Dubins Path on Sphere with Geographic Coordinates and Moving Frames2024-08-15T00:00:00+00:00Deepak Prakash KumarSwaroop DarbhaSatyanarayana G. ManyamDavid W. CasbeerMeir PachterIn this article, two methods of addressing path planning for a Dubins vehicle moving on a sphere are considered, wherein either spherical coordinates or a moving frame are considered to describe the vehicle's motion. The primary contribution of this article is to show the equivalence of these two approaches, which in turn shows that the results known for the moving frame-based description transfer to the model utilizing spherical coordinates.https://papers.cool/arxiv/2408.07235Variational Analysis of Proximal Compositions and Integral Proximal Mixtures2024-08-15T00:00:00+00:00Patrick L. CombettesDiego J. CornejoThis paper establishes various variational properties of parametrized versions of two convexity-preserving constructs that were recently introduced in the literature: the proximal composition of a function and a linear operator, and the integral proximal mixture of arbitrary families of functions and linear operators. We study in particular convexity, Legendre conjugacy, differentiability, Moreau envelopes, coercivity, minimizers, recession functions, and perspective functions of these constructs, as well as their asymptotic behavior as the parameter varies. The special case of the proximal expectation of a family of functions is also discussed.https://papers.cool/arxiv/2408.07252Active vibration control of nonlinear flexible structures via reduction on spectral submanifolds2024-08-15T00:00:00+00:00Cong ShenMingwu LiLarge amplitude vibrations can cause hazards and failure to engineering structures. Active control has been an effective strategy to suppress vibrations, but it faces great challenges in the real-time control of nonlinear flexible structures. Here, we present a control design framework using reductions on aperiodic spectral submanifolds (SSMs) to address the challenges. We formulate high-dimensional nonlinear optimal control problems to suppress the vibrations and then use the SSM-based reductions to transform the original optimal control problems into low-dimensional linear optimal control problems. We further establish extended linear quadratic regulators to solve the reduced optimal control problems, paving the road for real-time active control of nonlinear flexible structures. We demonstrate the effectiveness of our control design framework via a suite of examples with increasing complexity, including a finite element model of an aircraft wing with more than 130,000 degrees of freedom.https://papers.cool/arxiv/2408.07256On the local and global minimizers of the smooth stress function in Euclidean Distance Matrix problems2024-08-15T00:00:00+00:00Mengmeng SongDouglas GoncalvesWoosuk L. JungCarlile LavorAntonio MucherinoHenry WolkowiczWe consider the nonconvex minimization problem, with quartic objective function, that arises in the exact recovery of a configuration matrix $P\in \Rnd$ of $n$ points when a Euclidean distance matrix, \EDMp, is given with embedding dimension $d$. It is an open question in the literature under which conditions such a minimization problem admits a local nonglobal minimizer, \lngmp. We prove that all second order stationary points are global minimizers whenever $n \leq d + 1$. For $n > d+1$, we numerically find a local nonglobal minimum and show analytically that there indeed exists a nearby \lngm for the underlying quartic minimization problem. Thus, we answer in the affirmative the previously open question about their existence. Our approach to finding the \lngm is novel in that we first exploit the translation and rotation invariance to reduce the size of the problem from $nd$ variables in $P$ to $(n-1)d - d(d-1)/2 = d(2n-d-1)/2$ variables. This allows for finding examples that satisfy the strict second order sufficient optimality conditions.https://papers.cool/arxiv/2408.07268Fast Unconstrained Optimization via Hessian Averaging and Adaptive Gradient Sampling Methods2024-08-15T00:00:00+00:00Thomas O'Leary-RoseberryRaghu BollapragadaWe consider minimizing finite-sum and expectation objective functions via Hessian-averaging based subsampled Newton methods. These methods allow for gradient inexactness and have fixed per-iteration Hessian approximation costs. The recent work (Na et al. 2023) demonstrated that Hessian averaging can be utilized to achieve fast $\mathcal{O}\left(\sqrt{\tfrac{\log k}{k}}\right)$ local superlinear convergence for strongly convex functions in high probability, while maintaining fixed per-iteration Hessian costs. These methods, however, require gradient exactness and strong convexity, which poses challenges for their practical implementation. To address this concern we consider Hessian-averaged methods that allow gradient inexactness via norm condition based adaptive-sampling strategies. For the finite-sum problem we utilize deterministic sampling techniques which lead to global linear and sublinear convergence rates for strongly convex and nonconvex functions respectively. In this setting we are able to derive an improved deterministic local superlinear convergence rate of $\mathcal{O}\left(\tfrac{1}{k}\right)$. For the %expected risk expectation problem we utilize stochastic sampling techniques, and derive global linear and sublinear rates for strongly convex and nonconvex functions, as well as a $\mathcal{O}\left(\tfrac{1}{\sqrt{k}}\right)$ local superlinear convergence rate, all in expectation. We present novel analysis techniques that differ from the previous probabilistic results. Additionally, we propose scalable and efficient variations of these methods via diagonal approximations and derive the novel diagonally-averaged Newton (Dan) method for large-scale problems. Our numerical results demonstrate that the Hessian averaging not only helps with convergence, but can lead to state-of-the-art performance on difficult problems such as CIFAR100 classification with ResNets.https://papers.cool/arxiv/2408.07288A Modeling Framework for Equitable Deployment of Energy Storage in Disadvantaged Communities2024-08-15T00:00:00+00:00Miguel HelenoPaul LesurAlexandre MoreiraThis paper provides an analytical framework to incorporate the deployment of behind-the-meter energy storage coupled with rooftop solar, and their associated revenue streams, in the context of equitable energy policy interventions. We propose an extension to the Justice40 optimization model by adding storage and incorporating more realistic solar compensation mechanisms, such as net-billing, which allows for temporal revenue differentiation and the economic viability of behind-the-meter energy storage devices. The extended model includes household-level PV plus storage co-deployment alongside existing interventions, such as weatherization, rooftop PV only, community solar, and community wind. From a modeling perspective, we propose a novel approximation method to represent storage operations and revenue streams without expanding the temporal dimension of model, thus maintaining its computational efficiency. The proposed model is validated using a case study in Wayne County, Michigan, involving 3,651 energy insecure households.https://papers.cool/arxiv/2408.07382Multi-Phase Optimal Control Problems for Efficient Nonlinear Model Predictive Control with acados2024-08-15T00:00:00+00:00Jonathan FreyKatrin BaumgärtnerGianluca FrisonMoritz DiehlComputationally efficient nonlinear model predictive control relies on elaborate discrete-time optimal control problem (OCP) formulations trading off accuracy with respect to the continuous-time problem and associated computational burden. Such formulations, however, are in general not easy to implement within specialized software frameworks tailored to numerical optimal control. This paper introduces a new multi-phase OCP interface for the open-source software acados allowing to conveniently formulate such problems and generate fast solvers that can be used for nonlinear model predictive control (NMPC). While multi-phase OCP (MOCP) formulations occur naturally in many applications, this work focuses on MOCP formulations that can be used to efficiently approximate standard continuous-time OCPs in the context of NMPC. To this end, the paper discusses advanced control parametrizations, such as closed-loop costing and piecewise polynomials with varying degree, as well as partial tightening and formulations that leverage models of different fidelity. An introductory example is presented to showcase the usability of the new interface. Finally, three numerical experiments demonstrate that NMPC controllers based on multi-phase formulations can efficiently trade-off computation time and control performance.https://papers.cool/arxiv/2408.07386Fading memory and the convolution theorem2024-08-15T00:00:00+00:00Juan-Pablo OrtegaFlorian RossmannekSeveral topological and analytical notions of continuity and fading memory for causal and time-invariant filters are introduced, and the relations between them are analysed. A significant generalization of the convolution theorem that establishes the equivalence between the fading memory property and the availability of convolution representations of linear filters is proved. This result extends a previous such characterization to a complete array of weighted norms in the definition of the fading memory property. Additionally, the main theorem shows that the availability of convolution representations can be characterized, at least when the codomain is finite-dimensional, not only by the fading memory property but also by the reunion of two purely topological notions that are called minimal continuity and minimal fading memory property. Finally, when the input space and the codomain of a linear functional are Hilbert spaces, it is shown that minimal continuity and the minimal fading memory property guarantee the existence of interesting embeddings of the associated reproducing kernel Hilbert spaces and approximation results of solutions of kernel regressions in the presence of finite data sets.https://papers.cool/arxiv/2408.07450Dynamic Pickup-and-Delivery for Collaborative Platforms with Time-Dependent Travel and Crowdshipping2024-08-15T00:00:00+00:00Sara StoiaDemetrio LaganàJeffrey W. OhlmannWe study a pickup-and-delivery problem that arises when customers randomly submit requests over the course of a day from a choice of vendors on a collaborative e-commerce portal. Based on the attributes of a customer request, a dispatcher dynamically schedules the delivery service on either a dedicated vehicle or a crowdshipper, both of whom experience time dependent travel times. While dedicated vehicles are available throughout the day, the availability of crowdshippers is unknown a priori and they appear randomly for only portions of the day. With an objective of minimizing the sum of routing costs, piece-rate crowdshipper payments, and lateness charges, we model the uncertainty in request arrivals and crowdshipper appearances as a Markov decision process. To determine an action at each decision epoch, we employ a heuristic that partially destroys the existing routes and repairs them guided by a parameterized cost function approximation that accounts for the remaining temporal capacity of delivery vehicles. Through a set of computational experiments, we demonstrate the improvement of our approach over a myopic approach in several key performance metrics. In addition, we conduct computational experiments demonstrate the impact of inserting wait time in the route scheduling and the benefit of explicitly modeling time-dependent travel times. Through our computational testing, we also investigate the effect of demand management mechanisms that facilitate many-to-one request bundles or one-to-many request bundles on reducing the cost to service requests.https://papers.cool/arxiv/2408.07503Faster Stochastic Optimization with Arbitrary Delays via Asynchronous Mini-Batching2024-08-15T00:00:00+00:00Amit AttiaOfir GaashTomer KorenWe consider the problem of asynchronous stochastic optimization, where an optimization algorithm makes updates based on stale stochastic gradients of the objective that are subject to an arbitrary (possibly adversarial) sequence of delays. We present a procedure which, for any given $q \in (0,1]$, transforms any standard stochastic first-order method to an asynchronous method with convergence guarantee depending on the $q$-quantile delay of the sequence. This approach leads to convergence rates of the form $O(\tau_q/qT+\sigma/\sqrt{qT})$ for non-convex and $O(\tau_q^2/(q T)^2+\sigma/\sqrt{qT})$ for convex smooth problems, where $\tau_q$ is the $q$-quantile delay, generalizing and improving on existing results that depend on the average delay. We further show a method that automatically adapts to all quantiles simultaneously, without any prior knowledge of the delays, achieving convergence rates of the form $O(\inf_{q} \tau_q/qT+\sigma/\sqrt{qT})$ for non-convex and $O(\inf_{q} \tau_q^2/(q T)^2+\sigma/\sqrt{qT})$ for convex smooth problems. Our technique is based on asynchronous mini-batching with a careful batch-size selection and filtering of stale gradients.https://papers.cool/arxiv/2408.07688Finite Dimensional Projections of HJB Equations in the Wasserstein Space2024-08-15T00:00:00+00:00Andrzej ŚwięchLukas WesselsThis paper continues the study of controlled interacting particle systems with common noise started in [W. Gangbo, S. Mayorga and A. {\'{S}}wi{\k{e}}ch, \textit{SIAM J. Math. Anal.} 53 (2021), no. 2, 1320--1356] and [S. Mayorga and A. {\'{S}}wi{\k{e}}ch, \textit{SIAM J. Control Optim.} 61 (2023), no. 2, 820--851]. First, we extend the following results of the previously mentioned works to the case of multiplicative noise: (i) We generalize the convergence of the value functions $u_n$ corresponding to control problems of $n$ particles to the value function $V$ corresponding to an appropriately defined infinite dimensional control problem; (ii) we prove, under certain additional assumptions, $C^{1,1}$ regularity of $V$ in the spatial variable. The second main contribution of the present work is the proof that if $DV$ is continuous (which, in particular, includes the previously proven case of $C^{1,1}$ regularity in the spatial variable), the value function $V$ projects precisely onto the value functions $u_n$. Using this projection property, we show that optimal controls of the finite dimensional problem correspond to optimal controls of the infinite dimensional problem and vice versa. In the case of a linear state equation, we are able to prove that $V$ projects precisely onto the value functions $u_n$ under relaxed assumptions on the coefficients of the cost functional by using approximation techniques in the Wasserstein space, thus covering cases where $V$ may not be differentiable.