High-Order Error Bounds for Markovian LSA with Richardson–Romberg Extrapolation

40994@AAAI

Total: 1

#1 High-Order Error Bounds for Markovian LSA with Richardson–Romberg Extrapolation [PDF¹] [Copy] [Kimi] [REL]

Authors: Ilya Levin, Alexey Naumov, Sergey Samsonov

In this paper, we study the bias and high-order error bounds of the Linear Stochastic Approximation (LSA) algorithm with Polyak-Ruppert (PR) averaging under Markovian noise. We focus on the version of the algorithm with constant step size and propose a novel decomposition of the bias via a linearization technique. We analyze the structure of the bias and show that the leading-order term is linear in the step size and cannot be eliminated by PR averaging. To address this, we apply the Richardson-Romberg (RR) extrapolation procedure, which effectively cancels the leading bias term. We derive high-order moment bounds for the RR iterates and show that the leading error term aligns with the asymptotically optimal covariance matrix of the vanilla averaged LSA iterates. We validate applicability of our findings for the temporal difference algorithm in reinforcement learning.

Subject: AAAI.2026 - Reasoning under Uncertainty

40994@AAAI

#1 High-Order Error Bounds for Markovian LSA with Richardson–Romberg Extrapolation [PDF1] [Copy] [Kimi] [REL]

#1 High-Order Error Bounds for Markovian LSA with Richardson–Romberg Extrapolation [PDF¹] [Copy] [Kimi] [REL]