H-Probes: Extracting Hierarchical Structures From Latent Representations of Language Models

#1 H-Probes: Extracting Hierarchical Structures From Latent Representations of Language Models [PDF¹] [Copy] [Kimi] [REL]

Authors: Cutter Dawes, Aryan Sharma, Angelos Ioannis Lagos, Shivam Raval

Representing and navigating hierarchy is a fundamental primitive of reasoning. Large language models have demonstrated proficiency in a wide variety of tasks requiring hierarchical reasoning, but there exists limited analysis on how the models geometrically represent the necessary latent constructions for such thinking. To this end, we develop \textit{H-probes}, a collection of linear probes that extract hierarchical structure, specifically depth and pairwise distance, from latent representations. In synthetic tree traversal tasks, the H-probes robustly find the subspaces containing hierarchical structure necessary to complete the tasks; furthermore, in comprehensive ablation experiments, we show that these hierarchy-containing subspaces are low-dimensional, causally important for high task performance, and generalize within- and out-of-domain. Furthermore, we find analogous, though weaker, hierarchical structure in real-world hierarchical contexts such as mathematical reasoning traces. These results demonstrate that models represent hierarchy not only at the level of syntax and concepts, but at deeper levels of abstraction -- including the reasoning process itself.

Subjects: Computation and Language , Artificial Intelligence , Machine Learning

Publish: 2026-04-15 00:59:17 UTC

2605.00847

#1 H-Probes: Extracting Hierarchical Structures From Latent Representations of Language Models [PDF1] [Copy] [Kimi] [REL]

#1 H-Probes: Extracting Hierarchical Structures From Latent Representations of Language Models [PDF¹] [Copy] [Kimi] [REL]