Formalizing and Mitigating Structural Distortion in LLM Attention for Zero-Shot Graph Reasoning

#1 Formalizing and Mitigating Structural Distortion in LLM Attention for Zero-Shot Graph Reasoning [PDF¹] [Copy] [Kimi] [REL]

Authors: Donald Loveland, Puja Trivedi, Ari Weinstein, Edward W Huang, Danai Koutra

Large Language Models (LLMs) have shown promise for reasoning over Text-Attributed Graphs (TAGs). However, applying LLMs to graphs requires linearizing their structure into sequences, introducing distortion rooted in the graph bandwidth problem. While this distortion has been shown to degrade performance, it is often attributed to prompt design or model scale, leaving the underlying mechanism unclear. In this work, we show \textit{how} rotary positional embeddings turn graph linearization into bandwidth-dependent attention decay, suppressing attention between graph-adjacent nodes that are forced far apart in the serialized sequence. This shifts the focus of LLM-based graph reasoning from prompt engineering and scaling toward correcting attention misalignment. Motivated by this analysis, we propose \textbf{G}raph-\textbf{a}ligned \textbf{L}anguage \textbf{A}ttention (\textbf{GaLA}), a lightweight, inference-time modification for LLMs. GaLA biases attention toward graph-adjacent nodes while preserving the LLM's sequential inductive biases. Across TAG benchmarks, GaLA improves performance with negligible overhead, demonstrating that distortion is a correctable bottleneck in LLM-based graph reasoning.

Subject: Machine Learning

Publish: 2026-06-14 06:50:28 UTC

2606.15633

#1 Formalizing and Mitigating Structural Distortion in LLM Attention for Zero-Shot Graph Reasoning [PDF1] [Copy] [Kimi] [REL]

#1 Formalizing and Mitigating Structural Distortion in LLM Attention for Zero-Shot Graph Reasoning [PDF¹] [Copy] [Kimi] [REL]