AVFUoC53Lm@OpenReview

Total: 1

#1 Provable Length Generalization in Sequence Prediction via Spectral Filtering [PDF1] [Copy] [Kimi] [REL]

Authors: Annie Marsden, Evan Dogariu, Naman Agarwal, Xinyi Chen, Daniel Suo, Elad Hazan

We consider the problem of length generalization in sequence prediction. We define a new metric of performance in this setting – the Asymmetric-Regret– which measures regret against a benchmark predictor with longer context length than available to the learner. We continue by studying this concept through the lens of the spectral filter-ing algorithm. We present a gradient-based learn-ing algorithm that provably achieves length generalization for linear dynamical systems. We conclude with proof-of-concept experiments which are consistent with our theory.

Subject: ICML.2025 - Poster