Model-free stochastic linear quadratic control for discrete-time systems with multiplicative and additive noises via semidefinite programming

#1 Model-free stochastic linear quadratic control for discrete-time systems with multiplicative and additive noises via semidefinite programming [PDF] [Copy] [Kimi] [REL]

Authors: Jing Guo, Xiushan Jiang, Weihai Zhang

This paper investigates a model-free solution to the stochastic linear quadratic regulation (LQR) problem for linear discrete-time systems with both multiplicative and additive noises. We formulate the stochastic LQR problem as a nonconvex optimization problem and rigorously analyze its dual problem structure. By exploiting the inherent convexity of the dual problem and analyzing Karush-Kuhn-Tucker conditions with respect to optimality in convex optimization, we establish an explicit relationship between the optimal point of the dual problem and the parameters of the associated Q-function. This theoretical insight, combined with the technique of the matrix direct sum, makes it possible to develop a novel model-free sample-efficient, non-iterative semidefinite programming algorithm that directly estimates optimal control gain without requiring an initial stabilizing controller, or noises measurability. The robustness of the model-free SDP method to errors is investigated. Our approach provides a new optimization-theoretic framework for understanding Q-learning algorithms while advancing the theoretical foundations of reinforcement learning in stochastic optimal control. Numerical validation on a pulse-width modulated inverter system demonstrates the algorithm's effectiveness, particularly in achieving a single-step non-iterative solution without hyper-parameter tuning.

Subject: Optimization and Control

Publish: 2025-12-24 03:31:48 UTC

2512.20911

#1 Model-free stochastic linear quadratic control for discrete-time systems with multiplicative and additive noises via semidefinite programming [PDF] [Copy] [Kimi] [REL]