LSEAttention is All You Need for Time Series Forecasting | Cool Papers

#1 LSEAttention is All You Need for Time Series Forecasting [PDF³] [Copy] [Kimi⁴] [REL]

Transformer-based architectures have achieved remarkable success in natural language processing and computer vision. However, their performance in multivariate long-term forecasting often lags behind simpler linear baselines. Previous studies have identified the traditional attention mechanism as a significant factor contributing to this limitation. To unlock the full potential of transformers for multivariate time series forecasting, I introduce \textbf{LSEAttention}, an approach designed to address entropy collapse and training instability commonly observed in transformer models. I validate the effectiveness of LSEAttention across various real-world multivariate time series datasets, demonstrating that it not only outperforms existing time series transformer models but also exceeds the performance of some state-of-the-art models on specific datasets.

Subjects: Machine Learning , Artificial Intelligence

Publish: 2024-10-31 09:09:39 UTC

2410.23749

#1 LSEAttention is All You Need for Time Series Forecasting [PDF3] [Copy] [Kimi4] [REL]

#1 LSEAttention is All You Need for Time Series Forecasting [PDF³] [Copy] [Kimi⁴] [REL]