LLMs as High-Dimensional Nonlinear Autoregressive Models with Attention: Training, Alignment and Inference

#1 LLMs as High-Dimensional Nonlinear Autoregressive Models with Attention: Training, Alignment and Inference [PDF³] [Copy] [Kimi¹] [REL]

Large language models (LLMs) based on transformer architectures are typically described through collections of architectural components and training procedures, obscuring their underlying computational structure. This review article provides a concise mathematical reference for researchers seeking an explicit, equation-level description of LLM training, alignment, and generation. We formulate LLMs as high-dimensional nonlinear autoregressive models with attention-based dependencies. The framework encompasses pretraining via next-token prediction, alignment methods such as reinforcement learning from human feedback (RLHF), direct preference optimization (DPO), rejection sampling fine-tuning (RSFT), and reinforcement learning from verifiable rewards (RLVR), as well as autoregressive generation during inference. Self-attention emerges naturally as a repeated bilinear--softmax--linear composition, yielding highly expressive sequence models. This formulation enables principled analysis of alignment-induced behaviors (including sycophancy), inference-time phenomena (such as hallucination, in-context learning, chain-of-thought prompting, and retrieval-augmented generation), and extensions like continual learning, while serving as a concise reference for interpretation and further theoretical development.

Subjects: Machine Learning , Artificial Intelligence , Computation and Language , Signal Processing

Publish: 2026-01-31 00:37:53 UTC

2602.00426

#1 LLMs as High-Dimensional Nonlinear Autoregressive Models with Attention: Training, Alignment and Inference [PDF3] [Copy] [Kimi1] [REL]

#1 LLMs as High-Dimensional Nonlinear Autoregressive Models with Attention: Training, Alignment and Inference [PDF³] [Copy] [Kimi¹] [REL]