2504.04600

Total: 1

#1 Capturing AI's Attention: Physics of Repetition, Hallucination, Bias and Beyond [PDF1] [Copy] [Kimi3] [REL]

Authors: Frank Yingjie Huo, Neil F. Johnson

We derive a first-principles physics theory of the AI engine at the heart of LLMs' 'magic' (e.g. ChatGPT, Claude): the basic Attention head. The theory allows a quantitative analysis of outstanding AI challenges such as output repetition, hallucination and harmful content, and bias (e.g. from training and fine-tuning). Its predictions are consistent with large-scale LLM outputs. Its 2-body form suggests why LLMs work so well, but hints that a generalized 3-body Attention would make such AI work even better. Its similarity to a spin-bath means that existing Physics expertise could immediately be harnessed to help Society ensure AI is trustworthy and resilient to manipulation.

Subjects: Artificial Intelligence , Other Condensed Matter , Mathematical Physics , Adaptation and Self-Organizing Systems , Physics and Society

Publish: 2025-04-06 20:10:05 UTC